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INTRODUCTION  AND  SURVEY 


"The  concept  of  hierarchic  order  occupies  a  central  place  In 
this  book,  and  lest  the  reader  should  think  that  I  am  riding 
a  private  hobby-horse,  let  me  reassure  him  that  this  concept 
has  a  long  and  respectable  ancestry.  So  much  so,  that  defend¬ 
ers  of  orthodoxy  are  inclined  to  dismiss  it  as  'old  hat'  - 
and  often  in  the  same  breath  to  deny  its  validity.  Yet  I 
hope  to  show  as  we  go  along  that  this  old  hat,  handled  with 
some  affection,  can  produce  lively  rabbits." 

A.  Koestler 

(The  Ghost  in  the  Machine) 


It  is  in  economic  theory  that  aggregation  of  variables  has  been  most 
explicitly  used  es  a  technique  to  study  and  evaluate  the  dynamics  of  sys¬ 
tems  of  great  sire  and  complexity.  This  technique  is  founded  on  the  idea 
that  in  many  large  systems  all  variables  can  somehow  be  clustered  into  a 
small  number  of  groups  so  that:  (i)  the  interactions  among  the  variables 
of  each  single  group  may  be  studied  as  if  interactions  among  groups  did  not 
exist  and  (ii)  interactions  among  groups  may  be  studied  without  reference 
to  the  interactions  within  groups.  This  is  trivially  correct  when  vari¬ 
ables  are  functions  of  the  values  of  variables  of  the  same  group  but  not 
of  the  values  of  any  variable  in  any  different  group.  The  system  in  this 
case  can  be  said  completely  decomposable:  it  truly  consists  of  independent 
subsystems,  each  one  of  which  can  be  analysed  separately,  without  reference 
to  the  others. 

It  is  worth  making  the  terminology  more  precise  at  the  outset:  such  a  sys- 
ten  may  be  represented  by  a  completely  decomposable  matrix,  i.e.,  a  square 
matrix  such  that  an  identical  permutation  of  rows  and  columns  leaves  a  set 
of  square  submatrices  on  the  principal  diagonal  and  zeros  everywhere  else; 
a  decomposable  matrix,  as  opposed  to  completely  decomposable,  is  a  matrix 
With  zeros  everywhere  below  the  principal  submatrices  but  not  necessarily 
also  above.  Neai'-oomplete-deoomposability  and  neav-decomposabi lity  are  de¬ 
fined  by  replacing  the  zeros  in  these  definitions  by  small  non-zero  numbers. 


II 


H.  A.  Simon  and  A.  Ando  [S161]  investigated  circumstances  under  which 
variable  aggregation  still  yields  satisfactory  approximations  when  the 
variables  of  a  group  do  depend  on  the  values  of  the  variables  of  the  other 
groups,  but  only  weakly  compared  with  intra-group  dependency.  Several 
examples  taken  from  economics  [Si61],  physics  [Si61]  [Si62][Si69],  and 
social  sciences  [Fi62/l],  indicate  that  systems  of  that  kind  are  likely  to 
be  more  frequently  encountered  in  reality  than  systems  verifying  the  assump¬ 
tion  of  complete  decomposability.  The  authors  of  [Si61]  show  that  in  these 
systems,  qualified  in  [An63]  as  Nearly  Completely  Decomposable  Systems ,  aggregation 
of  variables  separates  the  analysis  of  the  short-run  from  that  of  the  long- 
run  dynamics.  They  proved  two  theorems.  The  first  one  states  that,  pro¬ 
vided  inter-group  dependencies  are  sufficiently  weak  as  compared  to  intra¬ 
group  ones,  in  the  short-run  the  system  will  behave  approximately,  and  can 
therefore  be  analyzed,  as  if  it  were  completely  decomposable;  whatever 
standard  of  approximation  is  required,  there  will  always  exist  a  non-zero 
degree  of  weakness  of  dependency  such  that  the  analysis  will  meet  this 
standard  of  approximation.  The  second  theorem  states  that  even  in  the  long- 
run,  when  neglected  inter-group  dependencies  have  had  time  to  influence  the 
system  behavior,  the  values  of  the  variables  within  any  group  will  remain 
approximately  in  the  same  ratio  as  if  those  inter-group  influences  had  never 
existed.  The  results  obtained  in  the  short-run  will  therefore  remain  ap¬ 
proximately  valid  in  the  long-run,  as  far  as  the  relative  behavior  of  the 
variables  of  a  same  group  is  concerned . 

These  two  theorems  are  formally  introduced  in  the  first  section  in 
the  case  of  stochastic  systems;  our  approach  and  our  notation  only  deviate 
in  a  few  details  from  [Si61],  Furthermore,  we  indicate  how  aggregative 


variables  representing  fused  states  [Ve69]  may  be  used  at  an  arbitrary 
number  of  levels  of  aggregation  to  evaluate  the  limiting  equilibrium 
probability  distribution  of  stochastic  systems  with  a  large  number  of 

states. 

In  the  following  four  sections  we  exploit  the  concept  of  near-complete- 
decomposability  to  analyze  stochastic  networks  of  interconnected  queues. 

In  Section  II  we  define  a  model  of  a  network  which  is  a  particular  case 
of  W.  J.  Gordon's  and  G.  F.  Newell's  model  [Go67].  An  arbitrary  fixed 
number  of  customers  .make  use  of  an  arbitrary  fixed  number  (L+l)  of  re¬ 
sources,  each  of  which  provides  a  certain  type  of  service.  When  service  is 
completed  by  resource  S>,  a  customer  proceeds  directly  to  resource  m  with 
probability  p^.  The  conditions  under  which  such  a  system  is  nearly  completely 
decomposable  into  L  levels  of  aggregation  are  established  in  Section  III 
in  terms  of  the  resource  service  rates  and  the  transfer  probabilities  p^. 

We  show  in  Section  IV  how,  when  these  conditions  are  fulfilled,  the  set 
of  resources  may  be  organized  in  a  hierarchy  of  aggregate  resources , 
each  aggregate  resource  being  analyzable  merely  as  a  single  server  queuing 

system. 

This  approach  presents  several  advantages  which  are  discussed  in 

* 

Section  V.  First,  the  determination  of  the  equilibrium  marginal  prob¬ 
abilities  does  not  require,  as  in  [Go67],  the  inversion  of  /he  transfer 

matrix  lip  II.  Explicit  and  closed  form  solutions  are  obtained  for  these 

11  An 11  j 

probabilities,  as  well  as  for  the  average  time  a  customer  waits  at  each 

resource.  These  expressions  facilitate  the  estimation /bf  the  equilibrium 

probabilities  when  the  number  of  states  is  large,  i.e.,  in  the  cose  of  a 
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large  number  of  customers  and/or  resources.  Next,  these  solutions  apply 
to  queuing  networks  which  are  more  general  than  the  model  considered  In 
[  tio67];  they  may  be  used  to  evaluate  networks  In  which,  as  in  the  general 
model  of  J.  R.  Jackson  [Ja63],  the  service  rates  are  at  each  service  stage 
almost  arbitrary  functions  of  the  congestion  at  this  stage;  transfer  prob¬ 
abilities  dependent  on  the  congestions  at  the  stage  of  departure  may  be 
taken  into  account  as  well.  Finally,  this  hierarchical  model  of  aggregate 
resources  appears  to  be  a  good  approximation  to  multi-queue  systems  in 
which,  instead  of  being  exponentially  distributed,  the  service  times  are 
random  variables  with  arbitrary  distribution  functions;  in  other  words, 
near-complete  decomposability  can  under  suitable  circumstances  dispense 
with  the  Poissonian  service  times  assumption,  which  is  classically  made 
in  networks  of  queues  by  sheer  necessity  of  overcoming  analytic  diffi¬ 
culties. 

Networks  Of  interconnected  queues  are  in  several  respects  adequate 
models  for  studying  congestion  problems  in  multiprogramming  computing 
systems  [Wa66]  [Sm66]  [Ar71]  [Bu71].  We  show  in  Section  VI  that  the  char¬ 
acteristics  inherent  in  memory  hierarchies  are  of  such  a  nature  as  to  make 
near-complete  decomposability  an  intrinsic  property  of  their  models.  Hence,  con 
sidering  a  computer  memory  as  a  hierarchy  of  aggregate  memory  levels »  we 
use  the  model  set  up  in  Section  IV  to  define  and  evaluate  performance 
criteria  for  multiprogramming  computer  memory  hierarchies.  We  indicate 
also  how  its  ability  to  cope  rather  simply  with  state-dependent  transfer 
probabilities  may  render  this  model  sensitive  to  allocation  policies  which 
adjust  dynamically  the  space  allotted  per  program  in  each  memory  level  to 
the  demand  made  by  the  programs  in  the  course  of  their  execution. 


In  the  second  pert  of  this  seme  section,  we  discuss  the  similitude 
existing  between  the  hierarchical  model  of  aggregate  resources  defined  in 
section  IV  and  the  structure  of  levels  of  abstraction  advocated  by  B.  H. 
Dijkstra  for  the  software  of  multiprogramming  computer  systems  [D169/1], 
He  conjecture  that  the  criteria  that  Indicate  what  resources  to  aggregate 
are,  in  many  cases,  the  same  as  those  which  specify  what  the  levels  of 
abstraction  should  be.  The  conditions  for  aggregation  in  such  systems 
are  expressed  in  terms  of  parameters  closely  related  to  the  physical 
characteristics  and  the  usage  of  the  hardware  resources.  Hence,  provided 
these  parameters  may  be  assessed,  resource  aggregation  conditions  might 
help  the  designer  choose  the  ordering  of  the  levels  of  abstraction. 
Variable  aggregation  appears  in  these  two  models  as  a  technique  that  en¬ 
ables  the  level -by-level  evaluation  of  the  system  to  be  interlaced  with 
its  level-by-level  design.  This  stems  from  the  fact  that  the  conditions 
for  aggregation  are  the  same  as  those  necessary  to  provide  sufficiently 
approximate  knowledge  of  the  performances  of  a  still  incomplete  system 
on  which  further  design  decisions  may  be  based.  Aggregative  models  of 
queue  networks  are,  in  the  sphere  of  computing  system  analytical  models, 
a  counterpart  of  the  level -by-level  simulation  techniques  recommended  in 
[Pa67]  [Pa69]  [Zu68]  [Ra69]. 

The  last  section  is  intended  to  illustrate  the  use  of  an  aggregative 
model  in  analyzing  a  given  computing  system.  This  concrete  case  study 
reveals  that  aggregation  is  not  only  adequate  in  obtaining  numerical 
results  when  the  number  of  parameters  involved  is  large,  but  also  helps 
to  gain  Insight  and  conceptual  clarity  on  the  respective  parts  played  by 


VI 


th..e  parameters .  We  study  the  model  of  .  tlme-.h.rl„g  systm.  in  which 
memory  ap.ce  end  processor  time  ere  respectively  allocated  on  «  demand 
paging  end  .  multiprogramming  basis.  We  first  carry  out  the  analysis  of 
the  short-run  equilibrium  attained  by  the  p.ge  traffic  between  primary 
secondary  memory.  Several  Interesting  phenomena  are  then  revealed 
by  relating  these  result,  to  the  long-run  behavior  of  the  flow  of  trans¬ 
actions  between  the  set  of  user  consoles  end  the  entire  computing  system. 

We  show  that,  owing  to  the  fluctuations  of  processor  usage  with  the  degree 
of  smltlprogramnlng,  the  .orbing  condition,  of  .  p.gi„g  time-sharing  system 

"*y  te  elth"  BtabU  °l  »be„  the  regime  of  operations  1.  stable 

the  fluctuations  of  the  congestion,  measured  by  the  current  number  of  pending 
user  transactions,  damp  „„t  ..  .  r.sult  of  the  .lteratlo„B  they  ^ 

to  the  ratio  of  the  trensactlon  Input  rate  to  the  processor  usage  factor; 

In  the  unstable  regime,  these  congestion  fluctuation,  are  reinforced  by 
the  wey  they  alter  the  input  rate  to  processor  usage  ratio.  We  indicate 
how  these  stable  and  unstable  values  of  the  congestion  may  he  calculated 
from  a  set  of  parameter,  defining  the  computing  load  on  the  systmn  and 
the  page  traffic  between  primary  and  secondary  memory.  Moreover,  the 
analysis  reveals  that  below  .  certain  counting  load  the  average  congestion 
and  the  mean  system  response  time  Increase  less  than  linearly  with  the 
lead,  while  beyond  this  computing  load  they  increase  more  then  linearly, 
this  critical  computing  load  1.  extension  to  service  systems  with 
congestion-dependent  service  rate  of  the  saturation  point  defined  In 
[KiegJ.  These  notions  of  Instability  and  saturation  lead  to  a  more  complete 
definition  of  the  clrommt.nce.  under  which  the  severe  system  performance 
degradation  known  as  thrashing  [De68/2]  may  occur. 
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I.  NEARLY  COMPLETELY  DECOMPOSABLE  SYSTEMS 

We  introduce  in  this  first  section  two  theorems  on  nearly  completely 
decomposable  systems  proved  by  H.  A.  Simon  and  A.  Ando.  We  limit  our¬ 
selves  to  the  case  of  stochastic  systems,  which  is  the  only  case  relevant 
to  our  study.  Except  for  the  subscripts,  we  have  kept  the  same  notation 
as  in  [Si61].  We  show  afterwards  how  aggregative  variables  may  be  used 
to  represent  fused  states  [Ve69]  and  to  evaluate  the  limiting  state  prob¬ 
ability  distribution  in  stochastic  multi-level  nearly  completely  decompos¬ 
able  systems. 

1.1  The  Slmon-Ando  Theorems 

We  are  interested  in  stochastic  models  of  the  form 

x(t+l)  -  x(t)Q  (1.1) 

where  x(t)  is  a  row  probability  vector  and  Q  a  stochastic  matrix.  The 
system  having  n  possible  state,  x^(t)  is  the  unconditional  probability 
of  the  system  being  in  the  l  state  at  time  t;  is  the  conditional 
probability  that  the  system  is  in  the  t  state  at  time  (t+1)  given  that 
it  is  in  state  k  at  time  t. 

* 

We  consider  next  a  matrix  Q  ,  that  can  be  arranged  in  the  following 
form  after  an  appropriate  identical  permutation  of  rows  and  columns: 

*  i 

Q1 

*  1 


*  * 


1.1 


where  Che  Q's  are  square  matrices  and  the  remaining  elements,  not  dis- 
1  *  N 

played,  are  all  zero.  Let  nT  be  the  order  of  QT;  then  n  ■  E  n  .  The 

1-1  * 

following  notation  is  adopted  to  refer  to  the  elements  of  the  vector  x  (t): 


x*(t)  -  (>£(t)}  -  {[x*  (t)j,,..,[x*  (t)  ] . [x*  (t)]) 

11  4  N 

if  ^ 

where  [x  (t)]  is  a  vector  of  elements  of  {xf  (t)}  so  that  if 
1I  * 

if  if 

Xf  (t)  -  xk(t). 


1-1 

then  k  ■  D  q -ti ,  i"l ,. . .  ,n_| 

J-l  J  1 

1-1 . N. 

if 

The  matrix  Q  is  said  to  be  completely  decomposable,  it  is  clear 
that  in  the  system 

*  *  *t 

x  (t)  -  x  (0)  Q 


if  it 

the  subset  [x.  (t)1  depends,  for  any  t,  only  on  [x.  (0)]  and  Q  ,  and  is 
1  *  *  I 

independent  from  [x.  (t)]  and  Q_,  J^I. 

1J  J 

Let  us  now  consider  the  slightly  altered  matrix  Q  defined  by 

Q  ■  Q*  +  »  C  (1.2) 

* 

where  C  is  a  matrix  of  the  same  size  as  Q  which  has  the  property  of  keeping 
* 

both  Q  and  Q  stochastic  (i.e.,  elements  of  C  are  at  most  equal  to  unity  in 
absolute  value  and  their  rowsums  amount  to  zero),  and  where  c  is  a  very 
small  real  number  to  be  specified  later  in  ^1.5.  Matrices  of  the  form  of  Q  are 
defined  in  [An63 ]  as  being  nearly  completely  decomposable  matrices. 


1.2 


The  two  theorem*  to  be  Introduced  apply  to  the  dynamic  behavior  of 
systems  defined  by  (1.1)  with  Q  defined  by  (1.2).  Let  the  roots  of  the 

submatrix  Q_  be  designated  by  X.  >  i"l»...*n_.  We  assuue  that  these  roots 

1  *  *  *  *  * 
ere  distinct  and  so  arranged  that  X,  >  X?  >...>  X  •  Q  being  completely 

lI  *1  I  I 

decomposable  and  stochastic,  we  have 

X.  m  1#  1*1, 2,..., N. 

lI 

Let  us  also  define  ft  as  being  the  minimum  of  the  absolute  values  of  the 
differences  among  all  roots  of  Q  whose  values  are  not  unity  and  of  their 
differences  from  unity;  we  have  in  particular: 


(1.3) 


The  roots  of  a  matrix  being  continuous  functions  of  its  elements  (see  [W«b98] 

#44  f  or  [Bo07],  e.g.),  we  can  define  for  any  positive  real  number  ft,  however 

small,  a  small  enough  c  so  that,  for  every  root  of  Q  ,  X.  ,  there  exists  a 

I 

root  of  Q,  X.  *  such  that 

T 


i®l , .. . , n  j i  I*  1 , . . .  ,N. 


(1.4) 


Hence,  we  may  classify  the  roots  of  Q  so  that 


1 1-X.  |  <  ft*  I"1,...#N, 

I  (1.5) 

1 1-X^  |  >  6  “ft*  i"2,...,nj;  I"1,...,N, 


where  ft  approaches  zero  with  e. 


1.3 


Now,  assuming  all  roots  of  Q  to  be  distinct,  the  Sylvester  expansion 
of  Q  into  itself  gives  (see  e.g.,  [Fr52]  page  78): 


N  ni 


Q  -  E  EX,  Z(X,  ) 
1-1  i-1  XI  XI 


(1.6) 


where  Z(X,  )  *  II  (Q-X,  I  )/  II  (X,  -  X,  ),  J"1 . nji 

li  j,j  jj  ”  i.j  i 

i/h  i/h  Jml . "• 


The  matrices  Z(X  )  have  the  following  properties  (see  e.g.,  [Wed34]  page  25); 
iI 


[Z(X.  )]  »  Z(X.  ), 
1I  I 


Z(\±  )  X  Z(Xj  )  -  0n,  i/jj. 
X  J 


N  “i 

E  E  Z(X.  )  -  I  , 
1-1  i-1  iI  n 


where  0  is  an  nXn  matrix  of  zeros  and  I  the  nxn  unit  matrix, 
n  n 

On  the  other  hand  Q  may  also  be  written 


N  ni 

E  E  X,  u(iT)  v(iT), 
1-1  i-1  XI 


where  the  symbol  ~  denotes  the  transpose  and  the  column  vectors  uUj)  and 

v(ij)  are  respectively  the  left  and  right  normalized  eigenvectors  of  Q 

associated  with  X,  (see  [Ta60],  e.g.).  So  that 
XI 


sk,*<v  ■  w  w*  isk- iin’ 


g  zk  k  <xt  >al»  1-1 . V 

!t-l  *  I 


and 


1.4 


Moreover  as  Q  Is  stochastic,  (J^l»...,n)  and  v(l^)  is  the 

limiting  probability  vector  lim  x(t). 

t-*» 

It  follows  from  (1.6)  and  from  the  properties  of  idempotency  and 
orthogonality  that 

N  °I 


QC  -  E  S  Z(X.  ). 

1-1  i-1  1I  *1 


(1.7) 


Following  the  classification  of  roots  defined  by  (1.5),  we  may  divide 
the  rlghthand  side  of  (1.7)  into  three  terms: 


t  N  t  »  “l  t 

Q  -  Z(X.  )  +  E  X.  Z(X.  )  +  E  EX.  Z(X.  ). 

*1  1-2  LI  I  1-1  i-2  1 I  *1 


(1.8) 


Q  may  not  be  expanded  directly  as  Q  since  the  N  largest  roots  of 

^  *fc  "it 

Q  are  all  unity.  However,  any  non  decomposable  submatrix  of  Q  may 
be  so  expanded: 

"i 

*t  *  *  i'*t  *  *  . 

Qt  -  Z  (X,  )  +  E  X.  Z  (X.  ). 

I  i-2  I  I 

* 

Bordering  each  matrix  Q  with  the  appropriate  number  of  rows  and 
columns  of  zeros  and  designating  those  nxn  matrices  by  the  same  name,  we  have 


*  N  * 
Q  ■  E  Q 
1-1  1 


*t 

and  thereby  Q  - 


N 

E 

1-1 


N  ”l 


*  *  "  *  *t  *  . 

Z  (X,  )  +  E  E  X.  Z  (X  ). 
LI  1-1.  i-2  XI  lI 


(1.9) 


1.5 


The  dynamic  time  behaviors  of  x(t)  and  x  (t)  are  specified  respectively 
by  (1.8)  and  (1.9).  To  compare  those  behaviors,  Simon  and  Ando  [S161]  have 
proven  the  following  propositions: 

Theorem  1.1.  For  an  arbitrary  positive  real  number  §2»  there 
exists  a  number  e2  auoh  that  for  «  <  «2» 

“  I  *ki  <V  ■  >l  <  h 

K  p  At  1  X 


And 


for  i *2 . Hj.;  I**l , « . « ,  N  £  1  ^  k ,  1  ^  n. 


Theorem  1.2.  For  an  arbitrary  positive  real  number  u>,  there  exists 
a  number  such  that  for  e  <  s^, 


T“ 1  2iijj  aCx '  \<lj)  "“'V  1  < 


U) 


for  K~1 . N;  1-1 . N;  J-1,...,N;  lSiSn^  1  S  j  S  0j; 

-*  4 

where  [v^  (lj)  ]  is  the  right  row  eigenvector  of  Qj  associated  with 
•J 

the  root  of  unity 


-* 


.  VV 


»  !»•••» nj» 


E  v.  (1T) 
j-1  JJ 


and 


where  ar^X.^)  is  given  by 


nI  nJ 


-* 


aTT(X,  )  -  E  E  v.  (1T)  z  (X,  ). 
\  i-1  J=1  ll  1  Vj  V 


1.6 


1.2  Interpretation  of  the  Theorems 

The  Implications  of  these  two  theorems  may  be  discussed  in  more  con¬ 
crete  term*. 

Since  by  (1.5),  the  X.  ,  I-1,...,N  are  almost  unity,  for  any  small 

t  I 

t,t  <  T2,  X,  ,  I"l, • . . ,N  will  stay  close  to  unity.  Thereby,  the  first  two 
I 

terms  of  the  rlghthand  side  of  (1.8)  will  not  vary  very  much  while  the 

first  term  of  the  righthand  side  of  (1.9)  will  not  vary  at  all.  Thus,  for 

* 

t  <  T2»  the  time  behavior  of  x(t)  and  x  (t)  are  defined  by  the  last  terms 
of  (1.8)  and  (1.9),  respectively.  But,  as  c  -»  0,  it  results  from  (1.4) 


Xi  "*  *  i 
*1  I 


and  from  theorem  1.1  that 


Z(X£  )  -*  Z  (\t  ) 
I  I 


for  1-2,...,^  and  I-1,...,N. 


Hence,  for  t  <  T2,  the  time  path  of  x(t)  must  be  very  close  to  that  of  x  (t). 

* 

Now,  since  the  X.  ,  i"2,...,n  ;  I-1,,..,N,  are  less  than  unity  (1.3), 

1 T  i 

1  * 

for  any  positive  real  g^,  we  can  define  Tj  such  that 


N  ni 


max 

k,f 

1  £k,  t  in 


|  E  E  X^  *kjj  (Xj  )  |  <  for  t  >  T^. 
1-1  1-2  I  I  1 


Likewise,  we  can  define  T^  such  that 


N  ni 


I  E  E  if  Lj(L  )  |  <?„  for  t>  I,. 
k,f  1-1  i-2  I  *  I  1  1 

1  i  k,  fin 


Moreover,  theorem  1.1  and  (1.4)  ensure  that 


Tj  -*  Tj  as  e  -»  0, 


I,  being  independent  of  c.  Since  T,  cen  be  ..do  large  ve  w.„t  by 
taking  .  sufficiently  sell,  while  T*  resins  independent  of  t,  we 
take  e  eo  that  T2  ts  very  muoh  larger  than  Tj. 

Finally,  provided  that  «  is  not  identically  tore  so  that,  except  for 

V  V  1-2 . .  “  n0t  1<le”tIcslly  “”ity,  we  cay  define  T3  so  that  fo, 

an  arbitrary  positive  real  number  g  and  t  >  T 

J  3 


k  ,1 

l^k.lSn 


m  \<h- 


owing  to  the  cle.siflc.tion  of  root,  defined  .hove,  I,  i,  greeter  then  T2; 
it  Increases  without  licit  a.  «  -  0.  For  I,  <  t  <  ^  the  lest  s^-etio! 
terc  of  (1.8)  1.  negligihle  end  the  time  path  of  X  is  detercined  by  the  two 

first  succetton  teres,  gut  theoree  1.2  specifies  thet  for  any  I  end  J,  the 

elements  of  Z(\  ) 

1C 

I  J  K  Vj  lK  i*  j  1K' 

depend  only  upon  I,d  and  J,  end  sre  slco.t  independent  of  1.  Thet  is,  for 

any  I  and  Jrtey  ere  proportional  to  the  element,  of  the  characteristic 
vector  of  Qj  associated,  with  the  root  of  unity 

\>1  dj),...,v  <1  ) . v*  (1  ) 

j  Jj  J  nj  j  J 


(1.10) 


1.8 


and  are  approximately  the  same  for  i-l,...,^. 

Thereby,  since  for  T2  <  t  <  QC  is  mainly  determined  by  the  two  first 

terms  of  (1,8),  the  vector  [x  (t)]  will  vary  with  t  during  that  period 
keeping  for  a  given  J  an  approximately  constant  ratio  among  distinct  elements 
which  is  identical  to  that  of  the  elements  of  (1.10).  In  other  words  a  local 
equilibrium  is  reached  within  each  subsystem  J.  We  may  at  this  stage  use 
such  an  equilibrium  probability  distribution  as  (1.10)  to  cluster  all  states 
of  a  subsystem  I,  that  is  to  replace  the  n-dlmenslonal  vector  x(t)  by  an 
N-dlmensional  vector  of  aggregative  variables  x  (t),  I“1,...,N,  rind  the  nxn 
matrix  Q  by  a  NxN  matrix  |  Ujjl  |  • 

Finally,  for  t  >  Tj,  all  terms  of  the  righthand  side  of  (1.8)  except 
the  first  one  become  negligible  and  the  behavior  of  x(t)  will  be  dominated 
by  the  largest  root  of  Q,  as  in  any  linear  dynamic  system. 

We  may  summarise  the  above  discussion  by  saying  that  the  dynamic  be¬ 
havior  of  a  system  representable  by  a  nearly  completely  decomposable  matrix 
may  be  analyzed  in  four  stages  called  respectively  by  Simon  and  Ando: 

(i)  short-run  dynamics,  (ii)  short-run  equilibrium,  (ill)  long-run  dynamics, 
(iv)  long-run  equilibrium.  In  more  precise  terms,  these  stages  are: 

(i)  Short-run  dynamics:  t  <  <  T2;  the  preponderantly  varying 

term  of  (1.8)  is  the  last  one  and  this  term  is  close  to  the 

★ 

last  one  of  (1.9);  x(t)  and  x  (t)  evolve  similarly. 

(ii)  Short-run  equilibrium:  Tj^  <t<T^;  the  last  terms  of  (1.8)  and 
(1.9)  have  vanished  while  the  time  powers  of  the  N  largest 
roots  x!  ,  I"l,. . . ,N,  remain  close  to  unity.  A  similar  equil- 

I  * 

ibrium  is  being  reached  within  each  subsystem  of  x(t)  and  x  (t). 


1.9 


(ill)  Long-run  dynamics:  T2  <  t  <  T3 ;  the  preponderantly  varying 

term  of  (1.8)  is  the  second  one.  The  whole  nearly  completely 
decomposable  system  moves  toward  equilibrium,  an  equilibrium 
among  relative  values  of  variables  within  each  subsystem  being 
approximately  maintained. 

(iv)  Long-run  equilibrium:  t  >  T3;  the  first  term  of  (1.8)  dominates 

all  the  others.  A  global  equilibrium  is  attained  since  in  this 

case  X1  ■  1. 

L1 

We  aonolude  that  in  the  short  run ,  i.e.t  stages  (i)  and  (ii)3  a  system 
whioh  enjoys  the  property  of  near-complete-decomposability  may  be  considered 
as  a  set  of  independent  subsystems  which  may  approximately  be  analyzed  sep¬ 
arately  from  one  another.  In  the  long  run  the  whole  system  appears  to  evolve 
keeping  roughly  the  state  of  equilibrium  within  each  subsystem.  Each  subsystem 
may  be  replaced  by  an  aggregative  variable ,  and  the  whole  system  analyzed  as  a 

set  of  interactions  among  those  variables3  the  interactions  within  each  subsystem 
being  ignored. 

1.3  Fused  States 

In  principle,  the  aggregative  variables  Xj.(t),  1*1, ...,N,  can  be  any 

_*  * 
function  of  the  equilibrium  state  distribution  [vt  (lj)]  of  the  system  Qj. 

provided  that  appropriate  transition  probabilities  q^  may  be  derived  ac¬ 
cordingly.  A  straightforward  approach  which  has  been  discussed  from  another 
point  of  view  in  the  literature  (see  e.g.,  [Ve69]),  is  that  of  "fusing"  the 
states  of  each  subsystem  1.  In  this  case  x^(t)  is  taken  as  the  sum  of  the 
probabilities  of  being  in  any  one  state  ij,  i“l . ^  of  subsystem  Qj  at 


time  t: 


1.10 


I 

x_(t)  -Ex  (t); 
1  1-1 


then,  assuming  the  system  defined  by  (1.1)  to  be  a,-  time  t  In  any  one 
state  1  ,  1-1,..., nj  of  subsystem  I,  the  probability  that  It  will  be  In 
any  one  state  Jj,  J— l,...,nj  of  subsystem  J  at  time  (t+1)  Is  given  by 

,  nl  "j 

■  <xi<e» '  £  x!I(t)  £  \v 


For  T2  <  t  <  T3,  we  may  write 


x  (t) 
I 

Xj(t) 


\  <V 


and  thereby  analyze  the  whole  system  as  a  set  of  subsystems  I-1,...,N 
whose  Interactions  among  subsystems  are  independent  of  time  and  given  by 

I  hull  where 


qTT  "  E  V.  (1T>  £  q*  4  • 

u  1-1  lI  1  J-l  lIJJ 


(1.11) 


Since,  at  this  stage,  only  the  roots  X^  ,  K-1,...,N,  are  to  be  taken 

1C 


Into  account,  we  have 


N 


q  ,  -  E  ^  zi  \  ^1  ^  * 

iIJJ  K^l  K  lIJJ  XK 


(1.12) 


Introducing  (1.12)  into  (1.11)  we  obtain 


"i  nJ 


qu  "  s  h  \  vj  (1T)  ^  (X,  ) 

IJ  K-l  1-1  J-l  iI  1  iI-Jj  lK 


1.11 


which  yields  Che  relation  between  elements  qT.  and  the  constants  aT  .(X,  ) 

I J  IJ  1 

which  were  Introduced  by  theorem  1.2: 


q 


IJ 


N 

£  X.  Of..  (X.  ) . 
K-l  K  U  TC 


Let  x  ,  1*1,..., N,  be  the  long-run  equilibrium  probability  of  being 
at  some  time  t  >  in  subsystem  I  which  Is  obtained  by  studying  the  whole 
system  as  a  set  of  subsystems  whose  Interactions  are  defined  by  (1.11). 

From  those  probabilities  one  can  approximate  the  long-run  equilibrium  prob¬ 
abilities  of  being  in  any  one  elementary  state  1^.  by: 


with 


and 


_* 

^  (Ij), 

I 


n 


I 


E  x  -  x 
1-1  1I 


N 

E  x  —  1)  1-1, ... ,n  ; 

1-1  1  1 


I— 1, . . • ,N. 


(1.13) 


1,4  Multi-level  Systems 

The  system  of  variables  discussed  in  the  foregoing  paragraph  can  be 
represented  as  a  two -level  hierarchy  with  the  aggregative  variables  at  the 
higher  level.  It  is  clear,  and  Simon  and  Ando  do  not  fail  to  observe  it, 
that  such  an  hierarchy  may  be  extended  to  more  than  two  levels,  each  vari¬ 
able  at  a  certain  level  being  an  aggregate  of  variables  of  the  immediately 
lower  level.  An  example  of  a  two  level  nearly  decomposable  matrix  (cor¬ 
responding  to  a  three-level  hierarchy)  would  be 


where  Che  order  of  magnitude  of  Che  elements  of  submatrices  P  is  larger 
than  that  of  matrices  Q,  this  latter  being  larger  than  that  of  matrices  S. 

At  the  first  level  of  aggregation  there  could  be  four  aggregative  variables , 
one  for  each  matrix  P;  at  the  second  level  of  aggregation  two  aggregative 
variables  would  correspond  to  the  partitions  indicated  by  the  dotted  lines. 

More  generally,  we  will  say  that  to  the  (btl)  levels  j^0,...,L  of  a  nearly 
completely  decomposable  hierarchy  may  correspond  L  levels  of  aggregation 
j^l,...,L  and  that  such  a  hierarchy  is  representable  by  an  L -level  nearly 
completely  decomposable  matrix.  In  such  a  system  the  time  instant  . 
beyond  which  the  whole  system  ceases  behaving  as  a  set  of  nearly  independent 
subsystems  at  the  fth  level  of  aggregation  is  also  the  time  instant  Tq  ^ 
at  which  subsystems  of  the  adjacent  upper  level  (ffl)  of  aggregation  start 
moving  towards  their  internal  equilibrium  (which  they  reach  at  time  T, 

1  9  XJT 1 

T2,f 5  T0,jfrfl’  W,...,L-1 

The  time  instant  at  which  a  global  equilibrium  is  reached  throughout 

the  whole  system  is  the  time  instant  T.  at  which  equilibrium  is  reached  among 

1,L 

the  aggregative  variables  in  terms  of  which  the  whole  system  is  described 
at  the  uppermost  level  L.  These  time  relations  among  levels  are  schema¬ 
tized  in  the  following  diagram: 


1.13 


level  L 


level  ( J0+1) 


To.fln  Ti.m 


level  i 


X 

T2,JN-1 


■>  t 


Each  level  l  is  a  level  of  aggregation  provided  there  exists 
small  enough  so  that 


an  e 


l 


T1  A-l  <  ^T2  i-1  =  T0  t)  ^or  ^2,...,!. 

The  analysis  of  the  entire  structure  proceeds  then  from  the  lowest 
level  of  aggregation  through  each  adjacent  level  up  to  the  highest  one:  at 
each  level  of  aggregation  the  system  Is  viewed  as  a  set  of  Independent  sub¬ 
systems  whose  short-run  equilibrium  Is  analyzed  In  terms  of  aggregative 
variables  descriptive  of  the  short-run  equilibria  reached  by  the  subcom¬ 
ponents  which  were  considered  at  the  adjacent  lower  level.  The  equilibrium 
obtained  ultimately  in  this  way  at  the  most  upper  level  defines  in  terms 
of  aggregative  variables  the  long-run  equilibrium  of  the  whole  system. 

From  this  long-run  equilibrium  at  the  uppermost  level  it  is  possible 
to  deduce  the  long-run  equilibrium  probabilities  for  each  lower  level. 

In  the  case  of  "state  fusing"  discussed  above,  we  may  for  instance  general¬ 
ize  the  relation  (1.13). as  follows.  Let  x^  ^  be  the  long-run  equilibrium 
probability  of  being  in  some  aggregate  subsystem  I  of  the  upper  level  L  at 
any  time  t,  t  >  T«.  Then  the  long-run  equilibrium  probability  x^”^  of 
being  in  subsystem  J  of  I  at  the  (L-l)th  level  of  aggregation  is  given  by 


1.14 


x 


(L-l) 

J 


^L-,)  <V- 


(1.14) 


Using  recurrently  this  relation  successively  with  L,  L-l,...,l  and  for  all 
subsystems  at  each  level,  yields  ultimately  the  long  run  equilibrium  prob¬ 
ability  x<°>  of  being  In  the  elementary  state  1,  for  all  1. 


1.5  CoheBlveness 

It  Is  easy  to  show  [F162/2]  that  the  largest  characteristic  root 
X  of  an  njXnj.  non  negative  matrix  Qj.  is  equal  to 


"I 

E  x  S 
1-1  1I1I 


“I 

E  x 
1-1  1 


(1.15) 


“J 

where  S.  designates  the  1th  row  sum  of  Q_  and  {x  *  0],  .  is  the  character- 
1I  I 

istic  vector  associated  with  X.  ,  the  strict  Inequality  holding  true  if  and 

I 

only  If  Qj  is  not  decomposable.  Now,  If  Qj  is  a  principal  sufcmatrix  of  a 
stochastic  matrix  Q,  S.  may  be  interpreted  as  the  probability,  when  In 
state  1  ,  of  remaining  in  the  subsystem  defined  by  Qj  after  the  next  transi¬ 
tion;  moreover,  the  distribution  of  the  probabilities  of  being  in  state 
1  ,  1-1,...,^,  at  time  t  on  condition  to  be  In  Qj  at  time  t-0,  approaches 

the  distribution  of  the  x  as  t  -»  «.  Thus,  the  weighted  average  (1.15) 

1I 

is  the  long-run  probability,  having  been  in  subsystem  Qj  during  the  interval 
[0,t],  of  remaining  in  Qx  at  time  (t+1),  t  -»  ®.  Following  [Fi62/l]  we  will 
refer  to  the  probability  X.  «s  being  the  adhesiveness  of  the  subsystem  Qj. 


one  =.»  deduce  from  (1.15,  that  .  necessary  and  sufflcleK  co„diclon 
for  ,  re  be  nearly  completely  decomposable  Into  the  Indecomposable  matrices 
e  le  thet  the  roots  1^,  I-1.....N  be  sufficiently  close  to  unity.  n,is 
condition  Is  necessary  to  hare  the  two  first  terns  of  (1.8)  suf¬ 
ficiently  close  to  the  first  term  of  (1.9,.  It  m  .  sufficient 
emce  If  the  ^  are  strictly  greater  than  aero.  (1.15,  re,„lres  that  the 
V  J“8I:  “  he  clos‘  c°  ““Ilf.  with  the  result  that 

e  <  1  -  max(S  ) 

1 

will  be  sufficiently  small  to  satisfy  the  conditions  of  theorems  1.1  and 
Now.  the  Frobenlus  theorem  (see  e.g.  [K.64],  p.ge  152,  states  that 

“l  *  \  *  S! 


-ere  st  -  min  s^  .„d  ^  .  „.x  ,  ^  ^  ls  .  ^  ^  ^  ^ 


cohesiveness  of  Qji  .  sufficient  conditio,  of  the  near-complete-decompos- 

eblllty  of  «  1,  thereby  thet  <I  be  sufficiently  close  to  unity  for  all  »„b- 
systems  Qj,  or  that 

f"(sI)  *  (1.16, 

1*1,..., N 


Relation  (1.16,.  merely  expressed  In  ten»s  of  the  entries  of  a  matrix, 
-in  be  used  1„  section  III  as  a  criterion  of  the  near-complete-decompos- 
ability  Of  matrices  defining  the  dynamic  behavior  of  networks  of  queues. 


1.16 


1.6  Remark 

Ando  and  Flacher  [An63]  proved  that  the  Simon-Ando  theorems  could 
be  extended  to  the  case  of  nearly  decomposable  matrices,  defined  by  replac¬ 
ing  the  zeros  in  a  decomposable  matrix  by  small  numbers.  They  showed 
that,  mutatis  mutandis,  the  conclusions  of  paragraph  1.2  remain  true  for 
such  systems:  the  short-run  dynamics  may  be  analyzed  as  if  the  system 
were  decomposable,  thus  Ignoring  the  weak  'feedbacks'  between  subsystems; 
the  long-run  influence  of  these  feedbacks  may  be  analyzed  in  terms  of 
aggregative  variables  representative  of  the  short-run  equilibrium  attained 

by  each  subsystem.  The  subsystems  which  are  aggregated  and  which,  in  the 

* 

Simon-Ando  case,  are  associated  with  the  submatrices  F^.  are  in  this  case 
those  corresponding  to  submatrices 


0  .  .  0 


where  the  G..  are  submatrices  of  dimensions  (n_,  N  -  E  n  ).  Such  systems 

1  J*I  J 

are  referred  to  in  [An63]  as  nearly  decomposable  block  triangular  systems . 


2.1 


II.  A  STOCHASTIC  MODEL  FOR  MULTIPROGRAMMING  SYSTEMS 


He  consider  .  system  which  consists  of  «  set  of  Independent  resources 

VR1 . V  Each  °f  those  resources  (*>  i,  e.p.ble  of  provldlng  .  dlf. 

ferent  type  of  service.  We  define  «  sequential  process  <*>  ln  thls  8ysten| 
«s  being  the  sequential  execution  of  .n  ordered  sequence  of  requests  for 
those  resources,  every  single  request  applying  to  e  single  resource,  every 
resource  completing  one  request  at  a  time. 

Let  the  service  time  of  resource  be  .  rendom  variable  exponentially 
distributed  with  a  mean  value  equal  to  l/p(!  ^  measure,  the  average  number 
of  requests  which  may  be  completed  by  R£  per  time  unit. 

*  sequential  process  is  stochastically  defined  by  a  set  of  transfer 
probabilities  Pmg»  0  S  m  ,  A  s  L;  pmi  is  tie  probability  of  a  request  for 
resource  ^  following  ta.edl.tely  the  completion  of  a  request  for  resource 


Rm  in  the  sarae  8equential  process.  We  have 


/„  Pmf  *  *•  m"° . L. 


The  fact  that  p^  may  be  nonzero  means  that  a  sequential  process  that  has 

just  completed  service  at  resource  may  need  to  be  rcserviced  immediately 
by  the  same  resource. 


In  queuing  theory,  the  terms  server  and  customer  would  be  common. 


2.2 


Let  N,  N  <  <*>,  be  the  total  number  of  sequential  processes  concur¬ 
rently  executed  In  tbe  system,  these  sequential  processes  being  mutually 
Independent  and  identically  defined  by  che  same  probability  set  (pm^}» 

As  resources  may  at  the  most  service  one  request  at  a  time,  sequential  v 
processes  will  eventually  queue  up  for  resources.  Let  i^  be  the  number 
of  sequential  processes  in  service  or  in  queue  at  resource  R^.  Since 
sequential  processes  may  at  most  be  waiting  for  one  resource  at  a  time, 
we  have 

L 

£  i.  -  N.  (2.0) 

jM  1 


N  is  supposed  to  remain  constant  in  our  system.  That  is,  each  process  is 

to  be  considered  as  lasting  for  an  infinite  period  of  time.  This  is  not 

so  important  a  restriction  as  might  appear  at  first  glance.  We  could 

indeed  easily  construct  an  equivalent  system  in  which  the  total  number  of 
L 

processes  £  i.  would  vary  in  time  without  exceeding  N  by  simply  consider- 
ing  a  resource  R^+1  with  service  rate 


^L+l 


X  X  i 


L+l 

L 


(2.01) 


and  where 


*L+1  ^  ^  i  » 

L+1  j6»0  Jfc 


which  would  model  the  creation  of  processes  in  our  system.  The  probability 
of  a  request  to  R.  being  the  last  request  of  a  sequential  process,  i.e., 
the  probability  of  a  sequential  process  "dying"  on  completion  of  a  request 
to  R^,  would  be  P^L+i)i  P(kfi)x  woul<*  be  tbe  probability  that  R^  is  the 
first  resource  requested  by  a  sequential  process. 


2.3 


The  state  of  the  system  Is  uniquely  defined  by  the  (L+l) -tuple 
(1q,...,1l>  and  there  are  df)  distinguishable  such  states,  l.e.,  the 
number  of  partitions  of  N  processes  among  (L+l)  sets. 

Let  P  (ig,...,iL,t)  be  the  joint  probability  that  at  epoch  t  the 
system  Is  In  state  (Iq,...,^).  These  probabilities  satisfy  the  system  of 
difference  equations  of  a  generalised  time -homogeneous  birth  and  death 
process: 

P(l0,i1,...,iL,  t-HO  - 


P( !q  ,  1 1 ,  •  •  •  , p  O 


(1-h  Z  hUj)  ^(l-p^)) 
Jt»0 


+  A  uZo  H<1,n)  P<1° . 1jE+1*-«»1B-1»*«'*1L,t)h*‘i  Pfi 


m 


+  0(h), 


(2.1) 


where  0(h)  is  the  probability,  negligible  for  small  values  of  h,  of  more 
than  one  request  being  completed  during  the  time  interval  (t,t+h],  and 
the  binary  function 


hU,)  -J 


if  lt-  0  , 
if  lt/0. 


accounts  for  the  impossibility  of  any  1^  taking  negative  values. 

This  system  of  difference  equations  may  be  represented  by  the  matrix 
equatlou 

P(t4h)  -  P(t)  (h  A+I)  +  0(h),  (2.2) 

where  P(t)  Is  a  vector  whose  each  element  Is  the  probability  for  the  system 


2.4 


being  at  time  t  In  one  of  the  distinguishable  states.  A  is  a  matrix 

of  constants  called  the  transition  Intensity  matrix  of  the  process,  with 
the  properties  that 

j  " 0 

for  all  values  of  i,  and 

*u10 

for  all  i/j.  I  is  the  matrix  unity. 

Equation  (2.2)  may  be  rewritten  in  the  form 

(P(t4h)  -  P(t))h’1  -  P(t>  A  +  . 

As  h  -*  0,  the  last  term  tenths  to  aero;  hence  the  limit  of  the  lefthand  side 
exists  and 

P'(t)  -  P(t)A 

where  P' (t)  is  the  vector  of  the  time  derivatives  of  the  elements  of  P(t). 
An  equilibrium  probability  distribution 

P  -  lim  P(t> 
t-*» 

is,  by  definition,  a  solution  of  the  system 

P  A  -  0  (2.3) 

satisfying  the  condition  that  the  sum  of  the  elements  of  P  equal  1.  For  a 
birth  and  death  process,  this  limiting  distribution  always  exists  and  is 
Independent  of  the  initial  conditions  P(0). 


2.5 


We  shall  not  pursue  the  Investigation  of  this  equilibrium  distribu¬ 
tion  which  has  been  studied  in  [Ja63]t  [Go67];  we  shall  instead  concentrate 
on  the  structure  of  the  matrix  A  which  defines  the  dynamic  behavior  of  the 
model. 


3.1 


III.  NEAR -COMPLET E -D ECOMPOS AB IL ITY  OF  MULTIPROGRAMMING  STOCHASTIC  MODELS 


We  establish  in  this  section  the  conditions  under  which  the  system 
defined  by  equations  (2.1)  enjoys  the  property  of  near-complete-decompos- 

ability. 

Henceforth  ve  will  neglect  0(h),  take  h-1  and  Q-Atl,  so  th.t  equation 
(2.2)  may  be  rewritten 

p(t+l)  -  P(t)  Q. 

L 

The  time  unit  ia  chosen  small  enough  so  that  *  1. 

Each  index  value  J.  J-l . <^f)  «t  the  atochaatlc 

matrix  Q  refers  to  a  distinct  state  (1q»1j . of  the  ■y8*'601*  We 

choose  an  arrangement  of  the  rows  and  columns  of  Q  such  that  the  (L+l) -tuple 

i* . iT  which  is  associated  with  the  index  value  j,  yields  for  the 

0  1  L 

function 

f(x)  -  iQ  +  ixx  +  i2x2  +’**+  1LX 


a  value  fj(*)  such  that 


fj(x)  >  f^U)  for  >l  and  Vx>  N> 


An  example  of  a  matrix  whose  rows  and  columns  are  arranged  in  this  order 
is  given  in  figure  3.1. 

It  result,  from  equations  (2.1)  that  any  non  diagonal  entry  qj([  of  Q, 
j/k.  1.  non  aero  and  equal  to  H  *>.  if  «lT  “lstB  * 


pair  (A,m)  so  that 


km 

kp 


-  1 

for  Vp^Jt,m, 


(3.1) 


3.2 


where  i.  .  Is  the  value  of  the  coefficient  of  x  In  f  (x). 

J*  j 

It  results  also  from  (2.1)  that  any  diagonal  entry  q^,  j  referring 

to  some  state  (1-,. . . ,1  . . ,1  is  given  by 

u  I,  m  l 


'  1  •  ■  1  '  Jo  Jo  ^  ‘  1  '  Jo  K(I‘)^(1-p«) 


m/i 


(3.2) 


Denoting  Q(N,L),  N  >  0,  L  >  1,  the  matrix  of  order  defined  by 

equations  (2.0), (2.1)  and  this  numbering  of  states,  we  can  prove  the  following 


Lerrma  3.1.  The  entries  of  the  stochastic  matrix  Q(N,L),  N>0,  I>1, 

may  be  partitioned  among  (Nfrl)  principal  submatrices  Q'(i  ), 

Li 

iL  -  0,...,N  so  that 

(i)  the  set  of  all  non  diagonal  entries  of  each  matrix 

Q'(iL),  iL“n,  is  the  set  of  all  transition  probabilities  between 
any  two  distinct  states  of  the  states  with  iL»n; 

(ii)  Any  non  zero  entry  outside  the  main  diagonal  of 

Q(N,L)  takes  only  the  value  V1"*  with  4m<L  if  it  is 

located  within  one  of  these  submatrices ,  and  with  max(j&,m)  »  L 
if  it  is  located  outside  these  submatrices. 


Proof.  1)  Owing  to  the  numbering  of  system  states  according  to  increasing 
values  of  f(x),  the  smaller  index  values  of  Q(N,L)  refer  to  all 
states  with  ij"0;  there  are  Cjr)  such  states,  i.e.,  the 
number  of  distinct  integer  solutions  of  equation 


L-l 

2  i  .  -  N. 
jfr>0  1 


3.3 


(L~fN  -2\ 

j  following  index  values  refer  to  all 
states  with  iL  -  1,  and  so  on  up  to  iL  -  N.  All  states  may 
therefore  be  partitioned  among  (N+l)  sets,  i  -  0,...,N,  the 
(iL+l)th  grouping  all  the  (L"^'1l)  states  for  which 


L-l 


E 

j6"0 


N-i. 


9 


in  agreement  with  the  Identity 


N 


E 


L-HN-1,.)  . 


These  (N+l)  sets  define  as  many  principal  submatrices  denoted 
Q'Uj.)  all  of  whose  non  diagonal  entries  are  the  transition 
probabilities  between  any  two  states  of  the  set  of  all  states 
for  which  i  has  a  given  value. 


2)  The  second  proposition  of  the  lemma  follows  immediately.  Any 
off  diagonal  entry  qJk(N,L),  j^k,  of  Q(N,L)  is  non  zero  and 
equal  to  0  £  l,  ra  £  L,  provided  there  exist  a  pair  (f,m), 

tfa,  which  verifies  equalities  (3,1), 

For  any  entry  qJk(N,L),  j^k,  located  within  a  submatrix 
Q'(iL).  ™  ^kL  =  thereby  necessary  that 

(VL)  «nd  (m^L) 


for  qjk<N,L)  to  be  non  zero. 

Alternatively,  for  any  entry  qjR(N,L)  j^k,  located  outside 
all  the  principal  matrices  Q’(iL),  iL-0,...,N,  by  definition 

1jL  ^  W  that  ifl 

( fpL)  or  (maL), 


which  completes  the  proof. 


In  order  to  help  visualize  the  structure  of  matrix  Q(N,L) ,  Q(2,3) 

is  displayed  in  figure  3.1.  Plain  lines  isolate  submatrices  Q' (0),  Q'(l), 

Q'(2);  E  denotes  the  ith  rowsum  of  non  diagonal  entries, 
i 

Using  the  preceding  lemma,  we  may  now  prove  the  following  proposition: 
Lerrna  3.2.  Ift 


r  L  L-l  L-l 

min  1  -  E  h(L  )p.  +  E  H<i.  )*i.  (  E 
ig*****^  l  k*0  k"0  m»0 


3  km  pkL^ 


S  i.  -  N 
k-0  K 


-  K<1I>L<^  -  »U.> 


ll  »  0,  (3. 


3) 


then  the  sparse  stoohastia  matrix  Q(N,L),N  >  0,  L  >  1,  defines  a  system 
nearly  completely  decomposable  into  (N+l)  systems  which  may  he  represented 
hy  stochastic  matrices  Q(N-n,L-l),  n«0 . N. 


Proof. 


with 


(1)  Lemma  3.1  allows  to  write: 

Q(N,L)  -  Q*(N,L)  +  eL  C(N,L) 


Q'(0) 


Q*(n,l)  - 


Q'(l) 


Q'(N) 


(3.4) 


•ff 

where  all  entries  of  Q  (N,L)  not  displayed  are  zero,  non  diagonal  entries 
* 

of  Q  (N,L)  have  only  values  k/m,  with  k,m  <  L,  and  non  diagonal 

entries  of  C(N,L)  have  only  values  {Oje’VjP^ie’1^?^}  with  m  <  L. 

On  the  other  hand,  the  jth  rowsum  of  Q(N,L),  say  Sj,  j  referring  to 
some  state  (iQ,...,i^,...,i^ . ^L-1*^L^*  ra*^  exPresse^  as 


3.5 


L-l  L-l 

q  +  S  E  H<ik)»AkPkin 
JJ  k-0  m-0  K 

na^k 


+  £  ’‘‘VVkL  +  K(V  J0  Vto-  (3'5) 


It  results  from  lenma  3.1  that  the  sum  of  the  two  first  terms  of  Sj, 

is  the  sum  Sj  of  the  entries  of  the  corresponding  row  in  Q'(iL>,  ij-0 . N. 

We  know  from  £  1.5  that  a  sufficient  condition  for  Q(N,L)  to  be  nearly 

completely  decomposable  into  Q'(0) . Q'(N),  is  that  the  lower  bound  of 

the  ooheeivene88  of  any  of  these  submatrices  be  sufficiently  close  to  unity; 
or  that  (cfr.  inequality  1.16): 

min(  S  J) »  1  -  min(S!)  . 

J  J  J  J 

Since  Sy-1,  this  condition  is  equivalent  to 

L-l  L-l  "I  L-l  L-l 

min  ,  +  E  E  *<  VVta  »  =  K<ik)'*kpM. +  "‘VJ.  Vto' 

1  JJ  k"0  m**0  k"0  m**u 

L  m/k 

Replacing  q  by  its  value  given  by  relation  3.2,  yields  inequality  (3.3). 

Inequality  (3.3)  expresses  the  condition  that  the  probability  of  an 
individual  sequential  process  moving  between  the  subsystem  of  resources 
Rq» ... ,R^  j  and  the  subsystem  R^  is  small  relative  to  the  minimum  prob¬ 
ability  of  his  remaining  in  the  subsystem  or  of  no  sequential  process 

moving  at  all. 

(2)  All  states  referred  by  the  index  values  of  a  matrix  Q'(n), 

n-0,...,N,  are  arranged  according  to  increasing  values  taken  by  the  function 
il  T  L_1 

(i„+...+i  ,x  +  nx^)  with  E  i,  -  N-n.  Hence,  it  results  from  proposition 
0  L-l  0  1 

(i)  of  lemma  3.1  that  each  non  diagonal  entry  of  Q'(n)  is  Identical  to  the 


3.6 


corresponding  entry  of  a  matrix  Q(N-n,L-l).  Q'(n)  is  however  not  stochastic; 

its  jth  rowsum  S'  is  equal  to  the  two  first  terms  of  S.  in  (3.5)  and  is 
j  j 

therefore  strictly  smaller  than  unity.  But  the  stronger  the  inequality 
(3.3),  the  more  negligible  will  be  the  two  last  terms  of  (3.5)  relative  to 
the  two  first  ones,  so  that 

Sj  -  Sj  with  Sj  -  1, 

and  Q'(n)  -  Q(N-n,L-l),  n-0 . N, 

which  completes  the  proof. 

Applying  recurrently  lemma  3.2  on  the  matrices  Q(n,jl),  n  >  0,  A  >  1, 
produced  by  the  recurrence  matrix  relation  (3.4),  one  obtains  the  following. 

Theorem  3.1:  The  system  defined  by  matrix  Q(N,L) ,  N  >  0,  L  >  1,  i8  (L-l)- 
level  nearly  completely  decomposable  and  may  be  represented  at  each  level 
°f  aggregation  At  A-L-l, . . . ,1,  by  a  set  of  (N+l)  aggregation  variables3 
each  one  defining  the  short-run  equilibrium  of  a  system  Q(n^, l) ,  n^»0,...,N 
if*  for  jHL-1,... ,1: 


r  JM-1  A  I 

Ll  ■  Jo  "(V*  +  J„  ^  -  *<«>> 


m=»0 


(3.5.1) 


A¥l 


IS  r  i.  SN 
k=0  k 


'  P  -  »(Wi)<LH] 


A  sufficient  condition  for  near<*complete  -decomposability  which  is 
stronger  than  the  above  condition  but  also  easier  to  verify  is  given  by 

Corollary  3.1:  A  sufficient  condition  _forQ(N,L)  to  be  (L-l )-level  nearly 
completely  decomposable  into  systems  Q(n^,i),  n^-O,...^;  j&»L-l,...,l,  i8, 


»0. 


3.7 


for 


C^(Jn  Pk”  ‘  pk(«-l))]  » 


min 
O^k^i,  "  m-0 


'‘w'Jo  P(«-1)" 


This  condition  is  obtained  by  ignoring  in  (3.5.1)  the  term 
which  is  always  non-negative. 


P(JH-1)(JM-1))*  (3*6) 

A+l 

[1  -  E  H(ik)^.] 
k-0 


Remarks.  (1)  The  inequalities  specified  in  theorem  3.1  are  sufficient  but 
not  necessary  conditions  for  near-complete-decomposability  since  they  refer 
to  the  lower  bound  of  the  subsystems'  cohesiveness.  On  the  other  hand, 
let  us  recall  that  theorem  1.1  guarantees  that  whatever  standard  of  approxi¬ 
mation  is  required,  a  degree  of  inequality  exists  Jhich  is  sufficient  to 
produce  results  satisfying  that  standard. 


(2)  We  will  be  more  particularly  concerned  in  the  sections  VI 
and  VII  with  the  special  multi-queue  model  in  which  Vk:  p  =  0;  for  k^O, 

KK 

pkQ  *  1>  and  £or  k  andm^O,  p^-0.  This  model  is  referred  to  in  [Bu71] 
as  the  central  server  model.  In  this  case  conditions  (3.6)  reduce  to: 


for  XfL-1,. . .  ,1:  min  (jjl  )»  ^ 

(KksjJ  k  ** 


and 


l 

^°(ra=l  P()ra  "  P°(^fl))  ^  ^1* 


(3.7.1) 


(3.7.2) 


on  the  other  hand,  it  is  easy  to  verify  that  a  sufficient  (but 


3.8 


not  necessary)  condition  for  (3.7.2)  to  hold,  is  pQ1  »  pn?  »...»  Pnr . 


■02 


■0L' 


(3)  Each  submatrix  Q'(iL),  iL"l,...,N  may  itself  be  partitioned 
into  principal  submatrices  Q' (^.i^),  0  s:  i^  £  N-i^,  whose  set  of  non 
diagonal  entries  is  the  set  of  all  transition  probabilities  between  any  two 


distinct  states  of  the  set  of  all  states  which  have  the  same  pair  (i  ,i  ). 

L-l  L 


Following  this  scheme  recurrently,  Q(N,L)  may  eventually  be  decomposed  into 

principal  submatrices  Q'  . ij).  In  figure  3.1  for  example,  dotted 

lines  isolate  submatrices  Q'(0,0),  Q*(0,2),  Q'(1,H),  Q' (1,1),  Q' (2,0).  Let 


XjM-l,n  be  the  number  of  submatrices  Q'(i i.  .),  j&»0, . . .  ,L-2;  x  , 

u  j&fl,n 


obeys  the  recurrence  relation 


XJ&fl,n  "  x£,n  +  xf+l,n-l 


where  n 


E  ^-m  and  with  the  bounda*y  conditions  x  -  jM-1,  x,  -  1. 

m=0  .  »  1  l,n 


1  w  /£+n\  p 

Therefore  =  V  jj  /  and  tbe  total  number  of  submatrices  Q'(i  ,...,i 

L  L- 

is  given  by 


L-jT 


J0  l,n  \  N  )’ 


Using  identity  (12.16)  of  [Fe68]  page  65  one  may  verify  that  the  sum 
of  the  orders  of  matrices  Q' (I^*****!^^)  amounts  to  the  order  of  Q(N,L) 


or  that 


i  (T)  c-^r)  -  (r)  ■ 


n=0 


(4)  One  cen  observe  in  figure  3.1  that  by  the  mere  virtue 
of  the  resource  service  rate  inequalities,  Q(N,L)  is  a  nearly  decomposable 
block  triangular  matrix  if  all  transfer  probabilities  have  approximately 


the  same  value. 


IV.  A  HIERARCHY  OF  AGGREGATE  RESOURCES 


We  assume  henceforth  that  the  stochastic  multiprog ranging  model  de¬ 
fined  by  matrix  Q(N,L) ,  N  >  0,  L  >  1,  is  a  (L-l)  level  nearly-completely-decomposable 
system.  This  amounts  to  postulating  that  there  exists  a  numbering  of  the 
resources  R^Rj,...^,...,^  so  that  the  service  rates  ^  and  the  sto¬ 
chastic  behavior  {p^},  OSi.js  L,  of  the  sequential  processes  obey 
the  (L-l)  inequalities  of  theorem  3.1. 

These  inequalities  imply  that  L  subsystems  s,  s-l,...,L,  consisting 

of  the  (s+1)  resources  Rfl . Rfl  may  be  distinguished  so  that,  on  the  average,  the 

rates  at  which  resources  of  a  same  subsystem  interact  upon  each  other  are  higher 
than  rates  of  interaction  between  any  resource  within  that  subsystem  and 
any  resource  outside  it.  In  other  words,  although  the  values  of  variables 

*0 . la  deflnln«  ^e  state  of  the  sequential  process  population  in  any  subsystem  s 

depend  on  the  past  values  of  variables  i^ . iL  outside  this  subsystem, 

these  dependencies  are  weak  compared  to  intra-subsystem  dependencies. 

In  this  case  the  first  Ando-Simon  theorem  (theor.  1.1)  guarantees  the 
following:  provided  that  inter-subsystem  dependencies  are  weak  compared 
to  intra-subsystem  dependencies,  the  former  may  be  neglected  when  carrying 
out  the  analysis  of  each  subsystem;  the  results  of  this  analysis  will  re¬ 
main  approximately  valid  in  the  short  run ;  the  weaker  inter-subsystems 
influences  are,  the  better  the  degree  of  approximation.  .  Now,  as  pointed 
out  by  M.  Fisher  and  A.  Ando  [Fi62/l]:”. . .  if  this  were  all,  it  would  be 
useful  but  not  very  remarkable  for  it  would  merely  mean  that  if  neglected 
influences  are  sufficiently  weak  they  take  a  long  time  to  matter  much;...". 


But  the  second  Ando-Simon  theorem  asserts  that,  even  after  a  time  period 
long  enough  for  inter-subsystem  influences  to  make  themselves  felt,  the 
relative  values  of  variables  within  each  subsystem  will  remain  approxi¬ 
mately  the  same  as  those  yielded  by  the  short-run  analysis.  This  justi¬ 
fies  an  analysis  of  the  long-run  behavior  of  the  system  in  its  entirety 
carried  out  strictly  in  terms  of  aggregative  variables  representative  of 
these  intra-subsystem  short-run  equilibria  and  in  terns  of  inter-subsystem 
dependencies.  On  the  basis  of  those  general  concepts,  the  recurrent  de¬ 
composition  scheme  defined  by  theorem  3.1  enables  a  hierarchical  model  of 
the  multiprogramming  system  defined  in  section  2  to  be  set  up.  This  model 
is  approximately  equivalent  to  Q(N,L) ,  the  degree  of  approximation  de¬ 
pending  on  the  relative  degree  of  weakness  of  inter-subsystem  dependencies. 
We  now  proceed  to  describe  more  precisely  this  "nearly  equivalent"  hier¬ 
archical  model,  which  will  turn  out  to  be  a  generalization  of  the  model 
described  in  [Cg70]. 

4.1  Decomposition  into  Levels 

Any  matrix  Q(n^,  i) ,  1  5  n^  S  K,  1  s  ^  L,  defines  a  system  in  which 

njfc  sequential  processes  compete  for  the  (Afl)  resources  . R^. 

Let  us  associate  with  each  such  matrix  a  queueing  system  which  will  be 
denoted  9Jt^(n^)  and  which  may  be  schematically  represented  as  follows: 


4.3 


A  fixed  and  finite  population  of  n^  sequential  processes  is  considered 
to  be  cycling  in  this  system.  These  sequential  processes  request  alterna¬ 
tively  two  resources:  resource  R^  and  what  we  call  an  aggregate  resource 
which  will  appear  to  be  a  synthesis  of  resources  R^, . . . ,R^_^.  We  say  that 
the  system  IR^(n^)  is  in  state  E^(n^_jJ n/}  whenever  n^_^  among  n^  sequential 
processes  are  either  waiting  for  or  being  executed  by  that  aggregate  re¬ 
source,  i.e.,  whenever  n^  ^  among  n^  sequential  processes  are  either  wait¬ 
ing  for  or  being  executed  by  any  one  of  the  resources  R^ . Rq,  and 

(njfc  ”  njg_i^  sequential  processes  are  either  waiting  for  or  being  executed 
by  the  resource  R^. 

Assuming  Q(N,L)  to  be  a  (L-l)  level  nearly-completely-decomposable  system,  we  may 
analyze  the  time  behavior  of  SR^(n^)  in  the  following  way.  We  regard  IRj(n^) 


4.4 


as  a  closed  system  that  no  sequential  process  may  leave  or  enter.  In 
other  words,  we  disregard  all  interactions  between  ayn^)  on  the  one  hand 

and  R^fl**,,»RL  on  the  other  handi  we  wil1  see  hereafter  that  the  rates 
of  those  interactions  are  sufficiently  low  to  be  neglected  in  the  analysis 
of  the  time  behavior  of  ^(n^).  We  further  suppose  that  all  conditions  are 
fulfilled  for  the  existence  of  a  probabilistic  equilibrium  in®.(nj.  This 
me...  chat  It  may  be  taken  for  granted  that  each  state  n  An ,  ,-0,...,n 

may  be  reached  from  any  other  state  Ej[(J  |n,),  within  a  finite  time 

period  (irreducibility  and  non-null  recurrence). 


These  conditions  are  neces¬ 
sary  for  the  stochastic  matrix  QCn^.X)  to  have  a  unique  root  equal  to  unity 

and  will  in  fact  be  fulfilled  in  most  cases  when  resources  have  a  non-zero 
and  finite  service  rate  since  the  population  of  5DI  (n  )  is  finite.  3J»(n  ) 

"A*  It  ft 

thereby  enjoys  the  property  of  ergodicity:  there  exists  a  stationary  dis¬ 
tribution  of  the  probabilities  VvJV  of  finding  Sty^)  in  state 

Vnl-llnl^»ni-l"0»**'»ni»  whlch  18  independent  of  time  and  of  the  initial 
conditions  of  SORj(n^). 

An  immediate  consequence  of  theorem  3.1  is  that  this  probabilistic 
equilibrium  of  Syy)  which  is  defined  by  QCn^f)  will  establish  itself  by 
some  time  (see  section  1.4),  that  is,  before  some  time 

T0,jfrfl»  when  resources  have  had  time  to  influence  9)1,  (n.) . 

Interactions  between  those  resources  and  B,^)  may  therefore  be  Ignored 
when  studying^ the  evolution  of  B,(n()  toward.  It.  probabilistic  equilibrium 

tVVlh»n*  .-0- 

X"  J. 


4.5 


i 

I 

i 


Similarly,  just  as  Q(n^, l)  is  itself  decomposable  into  subsystems 

Q(n£  1,X-l),njl  1-0,...tnJ&  ,  ^(n^)  is  nearly  decomposable  into  subsystems 

331  (n  ).  By  definition,  these  subsystems  have  attained  a  state  of  in- 
ti- 1  Jt-1 

ternal  equilibrium  ltTJj_^nJj_2lnjt-l^n^  "0  at  80016  time  Tl,Jt-l*  before 

X-2 

time  T  when  TO.(na)  starts  moving  towards  equilibrium;  those  internal 

0 ,  i  ll  Xt 

equilibria  are  maintained  during  the  period  from  to  when  ^(n^) 

settles  down  to  equilibrium.  In  the  study  of  the  time  behavior  of  TO£(nA) 
during  that  period,  we  may  therefore  assume  that  the  aggregate  resource  of 

TO  (n  ),  which  by  virtue  of  our  decomposition  scheme  is  nothing  but 

X  X 

when  TO.(n  )  is  in  state  E^n^J  n^) ,  ia  already  in  a  state  of  equilibrium, 
conditioned  on  n,  .  and  defined  by  the  distribution  (n j^_2 1  njj-l^n  *0* 

More  precisely,  let  cr^n^)  be  the  mean  number  of  requests  completed 
per  time  unit  inTO^(n^);  this  number  is  the  sum  of  the  mean  number  of  re¬ 
quests  completed  by  and  the  mean  number  of  requests  completed  by  th« 
aggregate  resource  per  time  unit.  As  our  decomposition  scheme  implies  that 
the  aggregate  resource  of  TO^n^)  is  whenever  ^(n^)  is  in  state 

EJj(nJj_il  Ojj) ,  we  have 


njt 

«,<»,>  -  >V1-VntK)>  +  2 ,  TTt<"*-i|"«)a*-i(Vi)- 

nJt-l" 


(4.1) 


( J&*2 , . . . , L)  ,(n  “1,...  ,N) 


For  J&»1,  (4.1)  reduces  to 

0^1  ( n i )  ■  ^1(l-,n1(n1ln1))  +  ^d-i(0|ni)),  (ni"l,»..,N). 


(4.2) 


4.6 


We  ill  refer  to  as  being  the  service  rate  of  SJl^n^) ;  clearly, 

requests  to  RqjRj . R^  are  completed  in  Ul^n^) ,  when  IDl^n^)  is  in 

equilibrium,  at  a  rate  of  cr^n^)  requests  per  time  unit;  a^n^)  being  de¬ 
duced  from  the  rates  p>o*^i* •  •  •  ,M>jg  a°d  the  equilibrium  distributions 

,-0>  £or  V.*1— *V  “”d  . *-»• 

Jo— m — l 

Remark.  Relations  (4.1)  and  (4.2)  seem  to  imply  that  the  constituents 
of  »  Rjj  an^  the  aggregate  resource^  work  independently  of  each  other, 


while  in  reality  these  resources  interfere  with  each  othe~.  In  fact  ^(n^) 


is  an  aggregative  variable  describing  only  the  equilibrium  of  2J^(n^)  In  which 


these  interferences  have  reached  a  steady  state  defined  by  the  distribution 


K(V  iMn,  -0* 

l- 1 


Let  [h  x  if  ^  +  ®(**)]  designate  the  probability  that  during  any 

interval  (t,t+h],  t  >  a  request  is  completed  by  2Rj(n^)  and  the  sequen¬ 
tial  process  being  serviced  applies  to  resource  R_  ,  k  >  l.  As  cr4(n .) ,  .  .(n.) 

K  Xi  At  JUfK  Ju 

obeys  a  recurrence  relation: 


*t,k ‘"t1 '  >*i(1-VnJni))',ik  +  E.)VvilV*i-i.k<n*.i)’  <4-3) 

II  .  4  A 


x-l 


(k  >  l) , (X»2, . . .  ,L-l)/n^»l,. . .  ,N) ; 


For  J&»1 ,  (4.3)  reduces  to 


♦l,k<nl}  a  M.xPlk  (l-n1(n1|n1))  +  ^p^d-TTj^lnj)),  (k=2 . L)  (n^®! . N)  (4.4) 


&  ick^MuiA i 
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A  (n ,)  will  be  referred  to  as  the  interaction  rate  of  2JL(nt)  upon 

"  Xt  Xf 

resource  R^,  k  >  l. 

n 

Probabilities  {TT^(ng|  n^ )  -(j,n0-l, ...  ,N  appearing  in  (4.2)  and  (4.4) 

may  easily  be  expressed  as  functions  of  n^  and  n^.  They  define  the  equilib¬ 
rium  state  of  SJl^np  whose  constituents  are  merely  Rq  and  R^.  The  probability 
that  a  state  transition  E^n^ln^)  -*  E^  (Hq+1|  n^)  ,ng*0, . . .  ,n^-l,  occurs  during 
a  time  interval  (t,t+h],  t<  Tq  2»  is  equal  to  (M^P^h  +  0(h))  where  0(h) 
is  negligible  for  small  values  of  h.  Likewise,  the  probability  of  a  transi¬ 
tion  E^n^ln^)  -»  E^(nQ-l|n^),  ngal,...,n^,  during  a  same  time  interval  is 
equal  to  (m>qPq^^  +  0(h)).  In  such  a  system,  equilibrium  equations  reduce 
to  (see  e.g.,  [Fe68],  pp.  460  ff.): 


^lp10  wl<°lnl>  "  ^0P01  ^l^1! nl> » 

<^lp10  +  M-oPoi^l^oK5  “  ^1P10  1Tl<n0"1lnl)  +  >*0P01  TTl<n0+1lnl)» 


P'0P01  ^1  ^nl  I  nl^  "  ^lPlO  TTl(nl'"1 1  • 


nQ*l , • . • ,nj-l. 


With  the  additional  condition  that  the  TT^(n^|n^),  n^a0,...,n^,  add 
to  unity,  these  relations  yield 


with  TT1(0|n1) 


n0°  \0P01/  / 


(4.6) 


Introducing  (4.5)  and  (4.6)  into  (4.4)  yields  the  values  of 

^1  k^nl^,nl=* . ^"2 . **,  8iven  the  service  rates  of  R^  and  R^  and 

the  transfer  probabilities  (Pok^o’^lk^O* 


4.8 


Using  the  values  obtained  for  ^(r^),  n^l,...,**,  one  may  easily 
obtain  the  distributions  £tt2 (n_  |  n2 )  j”2  of  the  equilibria  which  will  be 

nr° 

attained  by  systems  3^(n2),  n2-l,...N,  at  some  time  T.  .. 

lv2 

The  probability  of  .  state  tranaltlon  EjlnJoj)  -  E^-l^),  ^ . l, 

occurring  InU^)  during  nny  tine  Interval  (t.tHi],  t  <  T  la  equal  to 

093 

(*l,2*nl)h  +  °(h))  while  the  probability  of  a  transition  E^njn^  -  £2(0^1)112), 

nj-0,..., r^-l,  is  equal  to  C^^o^l3*4  +  °<h>]»  The  equilibrium  equations 
are  therefore: 

C*l,2<nl)  +  **2<p20+p21)3  TT2(nlln2)  “  TT2(nl"1ln2>  ^O^l3 3 

+  n2(n1+l|n2) 

•  •  •  ^2  —1)  9 

tM2<P2o*21)3  "  TT2(1ln2)  *1,2(1)» 

♦l,2<n2).TT2<n2ln2>  "  TT2(n2-1ln2)^2(p20'fl,21)]* 

The  additional  condition  that  probabilities  £tt (n  | n,)  j"2  add  to 

»  A  4 

unity  yields 


TVnlln23  -  _ TT2(0|n2),  nj-l,...,^ 

n  *,  ,(k) 

k*l 


with  TT2(0|n2) 


^2  ^20^2 13 


n,  1-1 


n  ♦,  ,(k) 

t-1  1,Z 


(4.7.1) 


‘if  — 'h-  • 


Introducing  these  probabilities  into  (4.3)  yields  the  values  of  the 
interaction  rates  ^  k  "  ■*»•••»!■»  8iven  the  values  of 

the  rates  ^  k(*‘1),  nfl,...,!^.  Proceeding  recurrently  in  this  way  up 
to  level  4,  the  probabilistic  equilibrium  of  a  system  SR^np,  n^lM..,N, 
is  obtained  as 


4-1  n 


4-1 


(|if  S  P «.) 

TT4(n4_l I n4)—  - ~ -  TT4(0ln4) 

n4-l 

n  ,  .<k) 

k-1  1 


TTJR<°ln4) 


/i  +  r 
v 


n4-l  1-1 


vr1  n4-i 


n  *4-1,4 

k»l  ’ 


(k) 


4*2,.. .  ,L;  n-1 . N. 


(4.7.2) 


Starting  with  the  systems  of  the  bottom  level  4f1,  the  probability  dis¬ 
tributions  frtjfaf.Jn mQ,  for  n^>lv...,Mv  are  obtained  from  (4.7.1)  and 

4-1 

(4.7.2)  successively  for  each  level  up  to  the  uppermost  one.  At  each  level 
4  the  values  of  the  interaction  rates  k  *  4+1,..., L  necessary  for 

the  analysis  of  the  levels  above,  must  be  deduced  by  (4.3)  from  the  values 

of  *4-l,k(n4-l)* 


4.2  Inter-level  Relationship 

So  far,  each  subsystem  JRjj(n^) ,  ^1,...,L-1,  has  been  analyzed  as  a 
closed  system  inaccessible  to  entry  or  exit  from  the  upper  levels.  ror 
this  reason  the  equilibrium  distributions  obtained  are  conditioned  on  a 


4.10 


fixed  number  of  sequential  processes  supposed  to  be  permanently  cycling 
inW^(n^).  These  conditional  Internal  equilibria  are  what  Simon  and  Ando 
call  the  short  run  equilibria  which  are  successively  attained  at  each  level 
of  aggregation.  We  have  seen  In  section  1.4  of  Chapter  I  how 
the  long-run  equilibrium  probabilities  for  each  lower  level  of  aggregation 
could  be  deduced  from  the  long-run  equilibrium  at  the  uppermost  level.  If 
>  (njj_i=0»  •  •  •  »*0  >  ( 1 » . . .  ,L) ,  is  the  long-run  and  unconditional 
probability  of  n^  ^  sequential  processes  being  in  service  or  in  queue  at 
any  one  resource  R^,  m=0, . . . ,  jfc-1,  and  s^i^),  i  N  is  the  long-run 

and  unconditional  probability  of  i^  sequential  processes  being  in  service 
or  in  queue  at  resource  R^,  the  relations  (1.14)  yield: 

®l/nL-l^  TTl/nL-J *  SL^L^  *  TTL^N-iLlN^»^nL_i»ixJ=0>,,,»N^» 


and  for  each  level  J&=L-1 . 1: 


VVl* 


N 
E 

Jfnj&-1 


'WWvilV' 


(4.8) 


N 

°j6  ljE 

At  level  1,  whose  only  constituents  are  resources  R^  and  R^,  we  have 

sq(*q)  -  a1^0^ * 


These  relations  express  simply  that  the  probability  of  having  a  popula¬ 
tion  of  n^  sequential  processes  cycling  at  level  l  is  equal  to  the  probability 
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of  being  in  one  of  the  states  E^^nJ n^) ,  n^-n^...^  at  level  (jM-1). 

The  long-run  fraction  of  time  a  resource  R^,  jfr«0,...,L  is  busy  is 
equal  to  (l-s^O) ).  The  mean  response  time  of  resource  R^,  defined 
as  being  the  mean  time  spent  by  a  sequential  process  in  queue  or  the 
service  at  resource  R^  may  be  easily  deduced  using  Little's  formula  [Li61 ] : 


N 


WA  -  [|iA(l-sA(0))]  E^  i^s^i^),  Jt-0,...,L. 


(4.9) 


Moreover,  if  the  uppermost  level  resource  R^  is  the  model  of  an  input 

mechanism  of  sequential  processes  whose: input  rate  obeys  equation  (2.01), 

one  may  deduce  from  the  probabilities  [a  (n.  .))N  n,  using  Little's 

/  L  L_i  "l-1 

formula,  the  mean  life  time  of  a  sequential  process: 


N 


w=  [pL(l-sL(0))j  E  VlaL<nL-l); 

"l-i"1 


(4.10) 


W  is  the  mean  time  spent  by  a  sequential  process  in  queue  or  in  service  at 
the  aggregate  resource  of  level  (L-l).  We  will  refer  to  W  as  being  the 
mean  response  time  of  this  aggregate  of  resources  RQ,... ,R^  y 

4.3  Concluding  Remark 

Near-Complete-Decomposability  allows  the  time  analysis  o£  a  system  to  be 
broken  up  into  distinct  stages  at  each  of  which  only  a  subspace  of  the  system  state 
space  needs  to  be  taken  into  consideration.  Thanks  to  this  property  we 
were  able  to  substitute  the  closed  multiqueues  system  Q(N,L)  whose  state 
space  is  of  size  (rL)  ,  with  L  x  N  subsystems  *01^(0^)  ,(j&=l,...  ,L;  n^-1 , . . .  ,N)  , 


each  having  no  more  than  (n^+1)  distinct  states.  Each  of  these  subsystems 

is  an  equivalent  representation  of  the  x.  „  subsystems,  x 
/ a  i \  A,N-n#  l,N-n, 


(jfc-l+N-n 

.  Jt-i  T 


defined  by  the  x^  N_n  principal  submatrlces 


1-1 

E 

m“0 


Q,(iL,...,iL.jM.1)  of  Q(N,L) ,  with  E^  (cfr.  remark  3  at  the  end  of 


section  3),  An  interesting  consequence  of  this  state  space  partitioning 
is  that  larger  state  spaces  of  more  refined  models  than  the  one  in  section 
2  may  be  analyzed.  Examples  are  given  in  the  following  section. 


5.1 


V.  CLOSED  MULTI-QUEUES  SYSTEM  ANALYSIS 


In  this  section  we  discuss  some  of  the  advantages  of  the  hierarchical 


approach  of  resource  aggregation  in  the  analysi 


s  of  multi-queues  systems  enjoying 


the  property  of  near-complete  decomposability.  We  are  introducing  two  generaliza¬ 


tions  of  the  model  defined  in  section  2  which  may  be  approached  by  this 
method. 


5,1  In  [Go67]  W.  J.  Gordon  and  G.  F.  Newell  solve  very  elegantly  the  system 
of  equilibrium  equations  (2.3)  by  a  separation  of  variables  technique.  They 
obtain  an  exact  expression  of  the  equilibrium  Joint  probability  P(iQ,...i  ). 
However,  the  form  of  this  expression  is  such  as  to  make  the  determination 
of  the  marginal  probabilities  tedious  for  large  values  of  N  or  L.  So,  for 
the  case  of  systems  with  non-uniform  service -rates,  assuming  that  one  re¬ 
source,  say  RL,has  an  effective  slower  service  rate  than  all  the  other  re¬ 
sources,  they  show  that,  at  the  limit  for  N  -  »,  the  distribution  of  sequen¬ 
tial  processes  within  the  system  is  regulated  by  this  slower  resource;  they 
are  able  to  give  in  this  case  asymptotical  expressions  for  the  marginal 
probability  of  i£  sequential  processes  in  queue  or  in  service  at  a  resource 
Rr  ^0,...,L-1.  This  method  may  in  certain  cases  present  two  types  of  dis¬ 
advantages:  Firstly,  the  values  obtained  are  only  asymptotical  values 
for  N  -»  “,  and  secondly,  the  method  requires  the  inversion  of  the 
(1*1)  *  (L+1)  transfer  probability  matrix  ||  p^J.  Both  disadvantages  may 
be  avoided  if  the  service  rates  and  the  transfer  probabilities  obey  the 
conditions  of  near-complete-decomposability  of  theorem  3.1  so  that  expres¬ 
sions  (4.8)  may  be  used  to  calculate  the  marginal  probability  distributions. 
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—  E,U":l0n  <2a)  be  Saner.llted  so  ..  to  allow,  .s  In  [Ja63],  the 

service  rates  of  any  resource  «,,«■<> . .  to  be  a  function  a  0 

of  the  number  1,  of  sequential  processes  currently  in  or  ln  ,Mvlce 

at  this  resource  (V^: jj. ^(0 )— 0)  . 

U.e  conditions  under  which  this  model  is  nearly-completely-decomposable 
•re  obtained  by  substituting  *<1^  with  ud^  l„  the  inequalities  of 
theorem  3.1.  Simpler  but  stronger  conditions  are  given  by  corollary  3.1: 

For  JML-1 


M><V<m^0Pk"  *  PMbfD)]  >> 


At 

Vl(1i+l>(m^oP(*H)m  -  '(gflKW)’- 


(5.1) 


n»e  interaction  rate  of  the  aggregate  SDl^n^) ,  n^-l,...,**, 
Rjj*  k  >  At  may  then  be  obtained  as 


upon  resource 
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which  extend  expressions  (4.7.2).  Using  (4.8)  one  obtains  the  unconditional 

probabilities  i^O . .  of  1^  sequential  processes  in  service 

or  in  queue  at  any  resource  R^,  jH),..., L. 

Applications  of  this  generalization  are  mentioned  at  the  end  of  [Cg70] 
and  use  is  made  of  it  later  on  in  paragraph  7.9.  The  special  case  of  this 
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generalisation  studied  in  [Go67]  is  the  case  in  which  ^(i^)  is  given  by 


where  a^(i^) 


if  iJt  *  V 

i£  il  *  V 


(5.4) 


at  each  level  of  this  system  there  are  v£  parallel  exponential  servers, 
each  with  mean  service  rate  ^  Sufficient  conditions  for  inequalities 
(5.1)  to  be  satisfied  are  then,  for  *»L- : 


l 

m.ln[M  E  Pkm 

k  m-0 

OSkSjfc 


l 

•  pk(W),]  »  p(W)m  -  <3-« 


Finally,  by  a  very  similar  generalisation,  transfer  probabilities 
pik(if»)  depending  on  the  congestion  at  the  stage  of  departure  may  also  be 
coped  with.  Conditions  (5.1)  as  well  as  relations  (5.2)  and  (5.3)  are  easily 
rewritten  in  this  case.  An  application  of  state-dependent  probabilities 
will  be  considered  in  ^6.1.4. 


5^3  The  hierarchical  model  of  aggregate  resources  may  also  approximate 
multiqueues  closed  systems  in  which,  instead  of  being  exponentially  dis¬ 
tributed,  the  service  times  of  the  upper  level  resources  R..  jHL.L-1 
are  random  variables  identically  and  independently  distributed  with 
arbitrary  distribution  function,  say  B^x).  This  conjecture  is  based  on 
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the  following  argument.  After  time  T^,  when  aggregate  resources  ©^(n^) , 
n.-l,...N,  are  in  equilibrium,  the  time  sequence  of  the  occurrences  of  a 
transfer  of  a  sequential  process  by  ©^n^)  to  a  resource  1^,  k  >  A,  on  a 
request  completion,  may  be  regarded  as  a  stationary  point  process  of 
intensity  ♦Jtfk<njj>,  <cf.  [Kh55J).  If  we  expand  ♦jt  k( n£)  in  terms  of  the 
transfer  rates  X  j“0,...A,  we  have: 

♦,/■*,>  -  *,<*-)'.  V'1  -  Vj<Vjl"i-j)1“t-JP<i-])k 

where  T70<n0ln0)  “°*  AJtU;  tll)  “  1;  and  f°r  J“1 . l* 


ni  VjM 

A.U-j;  n.)  -  E  ...  E  TTjt(n£.ilnJj)***TTjt-j+l(n£-j|nA-j+l 

”t-rl  vf1 


if  (n  )  may  be  considered  therefore  as  the  intensity  of  a  stationary  point 
process  resulting  from  the  superposition  of  (A*l)  independent  stationary 

renewal  processes,  each  of  intensity  [A^U-j;  njt> I jP(A-j) k]’ 
j"0 It  has  been  proved  (cf.  Chapter  V,  in  [Kh55])  that  such  a  super¬ 
posed  process  rapidly  approaches  a  Poisson  Process,  as  the  number  (f+1)  of 
individual  renewal  processes  increases.  This  result  holds  true  whatever 
the  distribution  of  the  time  intervals  between  successive  renewals  in  each 
individual  process  is,  and  thereby,  in  this  particular  caee,  whatever  the 
distribution  functions  Bj(x),  j“0,...,A,  with  respective  means 

^  r.dBj'xx., 


). 
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of  the  service  times  of  resource  Rj  are.  For  jt  large,  one  could  therefore 
envisage  analyzing  each  subsystem  ^(n^) ,  n^-N,... ,1,  as  a  finite  queuing 
system  G|M|l|n^with  an  arbitrary  inter-arrival  time  distribution  B^(x)  and 
a  Poisson  service  process  of  parameter  ^(n^j),  dependent  on  the  con¬ 
gestion  n^ji  • . .  ,n^. 

Under  appropriate  circumstances,  more  adequate  assumptions  than  the 
classical  Poissonian  service  times  assumption  may  therefore  be  made  if 
necessary,  when  an  aggregative  closed  queuing  model  as  defined  in  section  IV 
is  used. 


t 


...  i,  >.v  . 


6.1 


VI.  N EAR -COMPLETE -DECOMPOSAB 11, ITY  IN  COMPUTING  SYSTEMS 


We  deal  in  this  section  with  two  near-complete-decomposable  models  of 
computer  system  operations.  The  first  one  which  is  more  especially  hardware 
oriented,  is  the  model  of  a  computer  memory  organized  as  a  hierarchy  of 
storage  devices  of  increasingly  slower  access  speed  in  which  computations  are 
executed  on  a  multiprogramming  basis;  Near-complete-decomposability  condi¬ 
tions  of  this  model  turn  out  to  conform  with  those  a  computation  must  satisfy 
to  minimize  the  frequencies  at  which  it  needs  to  access  the  slower  levels  of 
the  hierarchy.  Given  a  statistical  definition  of  the  computations,  the  model 
set  up  in  Section  IV  is  then  used  to  define  and  evaluate  performance  criteria 
for  multiprogramming  storage  hierarchies;  an  optimal  degree  of  multipro- 
graiming  minimizing  the  average  hierarchy  access  time  is  evaluated.  We 
finally  show  how  state-dependent  transfer  probabilities  (cfr.  $5.2)  may 
render  the  model  sensitive  to  allocation  policies  which  adjust  the  space 
allotted  to  each  computation  at  every  memory  level  as  a  function  of  the  state 
of  the  multiprogrommed  computations. 

In  the  second  part,  which  is  more  software  oriented,  the  model  of  ag¬ 
gregate  resources  is  interpreted  as  a  simplified  model  of  the  resource 
control  and  allocation  function  of  a  multiprogramming  operating  system. 

We  conjecture  that  conditions  for  resource  aggregation  provide  in  many  cases 
rational  criteria  to  order  the  levels  of  abstraction  [Di68]  [Di69/l]  of  such 
a  system.  These  conditions  turn  out  to  be  closely  related  to  the  physical  char 
acteristics  and  the  usage  of  the  hardware  resources  and  may  therefore  help  the 
designer  choose  the  ordering  of  these  levels. 
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Aggregation  of  variable.  appear.  ln  the..  ^  ^  ..  .  ^  ^ 
•How.  the  level  by  level  evaluation  of  the  .y.t»  Co  be  lnt.rUc.d  ^ 

It.  level  by  uvel  de,l8„.  1M.  re.ult.  frM  t„.  fac£  ^  ^ 

fot  aggregation  are  the  same  ..  tho.e  nece.e.ry  to  provide  sufflcleotly 
approximate  knowledge  of  the  performance,  of  .  etui  incomplete  .y.tem  on 
-.ich  further  de.lgn  declelone  mey  he  ha.ed.  As  .„ch>  the  aggregative 

-del  Of  .  network  of  ,„e„e.  define  l„  section  IV  1.  .  counterpart  of  the 
level-by-level  almul.tlon  technlgue.  remanded  In  [P.67]  [Fa69] 

[Zu68]  [Ra69],  l„  the  ,phere  of  computer  eyetem  enelytlcel  model.. 

6‘1  Storage  Hierarchies 

Sel.l  Computations 

Although  the  conclusion,  of  thl.  .action  apply  to  .  wider  cl... 

°f  hU”rChl . .  “  °f  che  concept  of  linear  .torege  hler- 

•erchy.  defined  and  lnve.tlg.ted  In  fMg70].  to  Introduce  .  .t„oh..tlc  d.fl- 
nltion  of  program  execution. 

Ut  H°,M> . "t  be  *  Une,r  a  Cerege  hierarchy  of  memory  level, 

»t.  A-0....L;  the  higher  the  number  of  th.  level  1.,  the  larger  It.  capa¬ 
city  tend,  to  be  and  the  lower  It.  acce..  .peed.  We  define  the  capacity 

eg  of  M,  a.  the  maximum  number  of  dl.tlnct  Information  element.  M^y 
contain.  * 

A  computron  p  1„  thl.  hierarchy  may  be  Identified  1„  machine- 

Independent  terms  by  a  reference  string  [Oe70]  i  e  a 

y  Lve/vj,  i.e.,  a  sequence  of  refer¬ 
ences  p  ■  (r^..  rP  _p  v  , 

1’  •  y . ^(p)*  where  ie  understood  that.  If  rp  .o/f 

computation  p  reference,  a  (not  „ece...rily  dl.tlnct)  eluent  of  Information 


6.3 


of  name  l-P  at  the  yth  reference.  We  define  T(r^)  as  the  time  Instant  at 
which  computation  p  makes  its  yth  reference;  T(r£+1)  >  T(r£). 

We  assume,  as  is  generally  the  case,  that  a  reference  may  only 
be  serviced  from  the  fastest  level  MQ;  and  we  denote  by  (|Xq)  1  the  expecta¬ 
tion  of  the  amount  of  time  needed  by  a  computation  p  to  complete  a  refer¬ 
ence  to  an  element  r^  which  is  located  in  Mg,  (r£  €  Mg),  at  time  instant 
T(r£).  An  element  r£  located  in  M^Q,  at  the  time  T(r£)  at  which  it  is 
referenced,  must  be  accessed  and  retrieved  in  M^,  and  transferred  from 
M.  to  Mn  where  the  reference  may  be  serviced.  Instead  of  being  transferred 
one  by  one,  elements  are  transferred  by  blocks,  called  pages,  between  levels. 
We  denote  (pt  )-1  the  expectation  of  the  amount  of  time  needed  to  access 
and  retrieve  in  M^0  a  page  containing  a  given  element  r£  €  M^,  and  to 
transfer  this  page  from  to  Mg.  The  major  component  of  (p^)  1  is  the 
time  needed  to  retrieve  and  access  the  page  containing  the  requested 
element.  This  is  a  characteristic  of  each  memory  level.  We  assume  that 
memory  levels  are  identified  by  this  characteristic  and  ordered  in  such  a 
way  that 

(pt^'1  «  (pi2)_1  «...«  (l^)'1.  (6-1) 


A  linear  storage  hierarchy  is  a  hierarchy  in  which  the  only  paths 
to  move  pages  up  the  hierarchy  are  direct  ones  from  each  level  M^  to  level 


M 


JH-I 


,  Jfr=0, . . . ,L-1.  Paths  to  move  pages  down  are  unrestricted.  Let  f^, 


j&*0, . . . ,L,  denote  the  number  of  elements  ry  which,  in  a  given  reference 
string  p  of  length  d(p)  are  located  in  M^  at  time  instant  T(ry).  A  pro¬ 
cedure  which  determines  these  numbers  according  to  page  size,  replacement 
rule,  and  capacity  of  each  level  of  a  linear  demand  paging  hierarchy  may  be 
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found  in  [Mg70];  for  a  large  class  of  replacement  rules  it  is  shown  that 

f/ "*>'  be  °bt*lMd  a»  *  *“?ctlo“  »f  ct  of  the  total  lower 

levels 1  capacity  C^,  -  E  v  f  >  1;  for  jCO,  w.  denote  this  function 
o 

f0^c0^* 

The  relative  aooeee  frequency  to  of  a  given  reference  string 
p  is  then  given  by 

Fi(ci-i>ci>  -  d<p>'1  »  *5<c*-i*'i)  . L> 

“Uh  FS('o>  •  1  •  ^f.1.0,)- 


Letting  n  be  an  arbitrary  number  of  distinct  reference  strings  p, 

P-1 . .  each  wi*h  acceflS  frequencies  F^C^c^),  J&“1 . L,  we  may  de¬ 

fine  an  aoae88  probability  p  (c.  ,,c4)  as 

»  *-l  i 
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pi(ci-i-ci> 


limit  p-1 


f^(Gjt-l»CX) 


1  |  e  •  i  |L 


n  -*  »  n 


E  d(p) 
p*l 


With  p0(c0)  " 


1  - 
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J&-1 


Pje(c/y-i» 


ci)- 


The  set  of  probabilities  is  taken  as  the  defini¬ 
tion  of  the  stochastic  behavior  of  a  computation  in  the  hierarchy  M0 . 

the  remainder  of  this  section  6.1  is  applicable  to  any  memory  hierarchy  pro 
vided  such  a  set  of  probabilities  may  be  defined. 
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6*1.2  Single  Process  Storage  Hierarchy 

Let  us  assume  that  one  computation  at  a  time  is  executed  in  the 

hierarchy.  Then,  we  may  suppose  that  the  time  instant  T(rp  .)  at  which 

y+1 

a  computation  makes  its  (y+l)th  reference  is  also  the  time  instant  at 
which  the  yth  reference  is  completed.  The  mean  time  interval  between  two 
successive  references  of  any  computation  is  in  this  case: 


r;+1)  -  T(rJ>]  .  I£  v  M0  “ '*"■  T(r£> 

+  »  if  ry  €  at  time  -r(rp)  (6. 


An  average  hierarchy  access  time  T  may  then  be  defined 


p0(c0)  X  S>'  +  Pt(C/-l>cl>  x  +  (u,)'1] 


T  -  V  +  £  P^.c,)  X  V1, 


(6.3) 


as  well  as  an  average  fraction  of  time  T)  lost  in  transferring  pages  between 
any  one  level  M^,  i>  1,  and  MQ: 


1  -  S> 


(6.4) 


6.1.3  Multiprocess  Storage  Hiprarchi 


Suppose  now  that  transferring  pages  from  M^0  to  MQ  and  servicing 
references  in  MQ  may  be  performed  simultaneously.  Suppose  also  that  memory 
space  at  the  various  levels  is  equally,  statically  and  permanently  shared 
among  N  independent  computations  p,  p-l,...,N,  executed  in  a  multiprogramming 
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b“U‘  Ihl*  ”“n8  th*C  the  *x*cutlon  of  the  N  reference  .cringe  1.  inter- 
leaved  In  Che  following  way:  Iff  6  at  In. cant  T(rp)  and  iff  a  . 

computation  p'  of  which  rj,'  1.  the  la.t  reference  completed  end  for  which 
r£,+1  €  M0  at  .ome  time  t  a  T(rJ)  before  the  tr.„.fer  of  the  page  contain¬ 
ing  ry  1.  completed,  then  r£+1  1.  ,.r,lced  at  time  t. 

Ihl.  .cheme  provide,  for  a  better  utilisation  of  M„  In  which 
reference,  mey  now  be  .ervlced  while  page,  are  being  t ran. f erred.  A.  a 
consequence  of  thin  however,  the  performance  mea.ure.  defined  by  (6.3) 

«d  (6.4)  are  no  longer  valid:  they  require  Indeed  that  only  „„e  reference 

to  any  one  level  *.» . L,  be  mad.  and  completed  at  a  time,  where.. 

under  multiprogramming  time  Interval,  CtOtJ^)  -  T(rP),  of  dlstlnct  coo|put„ 
tions  p  may  overlap  each  other. 

In  a  multiprogramming  storage  hierarchy,  the  average  time  during 
which  M0  1.  not  being  referenced  within  a  time  Interval  [r(rp+1)  -  T(rp)] 

^  U“  ^  W'  Wh<,n  ry  «  Vo  «  T(rJ).  On  toother  hind, 

«  on.  page  only  „.y  be  tr.n.f.rred  from  a  level  to  at  a  time, 

request,  for  page  tranaf.r.  will  eventually  ,ua».  „p  at  e.ch  level  and  tlma 
Interval.  [T(r^fl)  -  T(rJ)]  will  be  prolonged  by  these  queuing  time.. 

Performance  criteria  corresponding  to  (6.3)  ami  (6.4)  which  allow 

the  benefit  of  multiprogramming  to  be  a..e..«d  will  be  Introduced  In  para- 
graph  6.1.5. 

Near-Complete-Pecomposabllltv 

Let  u.  Identify  each  memory  level  M  with  a  resource  R,  with 
exponentially  distributed  service  time,  of  mean  p'1,  each  of  tha  „ 
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multiprograsnmed  computations  to  a  sequential  process  stochastically  de- 

C.  .  c j  _ 

fined  by  the  same  probability  set  fp,(-'  1  .  -£)  “|L  .  Let  1 

3  N  *  N  mj6-0  LeC  V 

£*1,...,L,  i^*0,...,N,  be  the  number  of  computations  which  at  some  time  t 
are  waiting  for  a  page  to  be  transferred  from  to  M^,  and  let 


i0  "  N  -  Si. 

*1  1 

Defined  in  this  way,  this  model  of  a  multiprogramming  storage 
hierarchy  is  a  closed  multiqueue  system  whose  state  may  be  defined  by  the 
(Lfl)-plet  (1q t • • • »i^)  and  whose  transition  stochastic  matrix  is  Q(N,L). 

Conditions  for  neamcomplete-decomposability  of  this  system  into  subsystem  Q(n  ,*), 
V1 . .  ^L-l,...,!,  reduce  to  (see  remark  2  ,  end  of  section  III): 


for  A“L-1, . . . ,1:  min 
(KkSjJ 


and  4  -  . 


(6.5) 


sine,  transfer  probabilities  {p^}^  are  all  equal  to  unity. 

Conditions  (6.5)  are  expressed  In  terms  of  rates  at  which  a  page 
may  be  retrieved,  accessed  and  transferred  from  each  level,  and  in  terms 
of  rates  at  which  levels  are  referenced  by  an  individual  computation.  More 
precisely,  the  set  of  probabilities  (p/-^-  ,^)  1^.#  define,  the  stochastic 
behavior  of  a  computation  in  a  single  process  storage  hierarchy  of  capacities 
t“lW  These  probabilities  follow  from  the  statistical  properties  of  the 
reference  strings,  but  also  from  the  choice  of  storage  management  parameters 
such  as  the  memory  level  capacities,  the  page  size,  the  replacement  rule. 

The  near-complete  decomposability  of  the  multiprogramming  storage  hierarchy 
model  may  be  assessed  by  using  e.g. ,  a  technique  as  the  one  described  in 


6.8 


[Mg70],  provided  that  these  parameters  be  the  same  for  each  multiprogrammed 
computation. 

One  may  expect  inequalities  (6.5)  to  hold  in  most  storage  hier¬ 
archies  for  various  reasons.  First,  the  lower  bound  for  the  average  time 
needed  to  complete  a  single  reference  in  a  N-process  hierarchy,  which  cor¬ 
responds  to  the  case  of  no  queuing  of  page  transfer  requests,  is  equal  to 


-1  ^  ci  -1 

V  +  s  '-ip'-?  *  <**>  • 

W  tr>\ 


(6.6) 


Minimizing  (6.6)  is  mandatory  to  optimize  the  average  execution  time  of  a 
computation.  Both  the  user  and  the  system  storage  management  policies  will 
aim  at  this  insofar  as  they  have  control  over  the  placing  of  information 
elements  in  the  various  levels,  i.e.,  over  the  values  of  the  access  prob¬ 
abilities.  But,  as  a  result  of  inequalities  (6.1),  the  only  possible  way 
to  minimize  (6.6),  is  to  achieve  the  set  of  inequalities 

°"  c’  CL-1  CA 


»„<-5>  »  n<-5  .-S>  »•  •  •»  ■  <*•» 


which,  together  with  (6.1),  are  (see  remark  (2),  end  of  section  III)  suf¬ 
ficient  conditions  for  (6.5)  to  hold. 

Secondly,  it  has  been  observed  that  computations  enjoy  what  has 
been  called  the  property  of  locality  [Be66,  Va67,  Be68,  De68/l,  De70];  that 
is  they  favor  at  each  instant  of  their  execution  a  subset  of  their  informa¬ 
tion  instead  of  scattering  their  references  uniformly  over  their  total  set 
of  information,  this  subset  changing  membership  relatively  slowly  in  the 
course  of  the  computation  execution. 
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As  a  consequence  of  this,  every  page  transfer  which  is  occasioned  by  a 
single  reference  to  one  upper  level  is  likely  to  generate  several  refer¬ 
ences  to  Mq.  On  the  other  hand,  page  replacement  policies  like  LRU  (see 
e.g.,  [De68/2])  take  advantage  of  this  property  to  accentuate  inequalities 
(6.7).  Inmost  cases,  therefore,  conditions  for  L-level  near-complete- 
decomposability  will  be  satisfied. 

Remark .  It  is  probably  worth  saying  a  word  about  what  happens  when  in¬ 
equalities  (6.1)  only  are  verified  and  not  inequalities  (6.6).  One  could 

show  that  in  this  case  stochastic  matrices  Q(n^,  J&),  j^l . L,  are  (see  remark  1.6) 

near -decomposable  block-triangular  matrices  in  the  sense  of  Ando  and  Fisher 
[An63  ] .  They  proved  that  the  Simon -Ando  theorem  could  be  modified  to  the 
case  of  such  matrices.  A  resource  aggregate  model  analogous  to  the  model 
defined  in  Section  IV  could  therefore  be  set  up  to  cope  with  such  systems. 

6.1.5  Memory  Level  Aggregation 

If  conditions  (6.5)  are  verified,  the  aggregative  model 
defined  in  Section  IV  may  be  used.  We  proceed  now  by  demonstrating  how 
performance  criteria  comparable  to  (6.3)  and  (6.4)  may  be  defined  within 
that  model. 

We  regard  the  aggregate  fflyn^) ,  trl . L,  as  a  stor¬ 
age  hierarchy  of  levels  Mfl . M «,  in  which  n.  computations  defined  by  a 

i»k-i  c.  ,  1 

same  probability  set  {pfc( — 1^-0  are  being  nmltiprogranraed.  cr^n^) 
is  defined  as  the  mean  number  of  references  serviced  in  Mq  per  time  unit 
within  this  aggregate,  and  is  given  by  (4.1)  ,  (4.2)  which  reduce  to 
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'7«<”*>  "n  r=l  Vni-ll'\t>'W,t-l) 

“e-i  L 


(6.8) 


CTl(nl)  *  M*o(1-TT1(°|ni>)  ,  j6-2,...,l 


(6.9) 


same  way,  ♦^(n^)  defined  as  the  rate  at  which  aggregate  2Jl^(n^) 
references  memory  level  k  >  1.  is  given  by  relations  (4.3)  and  (4.4)  in  whil 

Pjflc  =  °»  ^*1* • • • »L  ,  k  >  l, 

C.  .  c. 

P  =  /  k-1  k. 

Ok  Pk(“lT,~)* 


Condition.!  probabiUtles  „,)  are  yielded  by  (4.7.2)  in  which 

P£Q  =  ^"lj.o.jL 

P£i  *  0,  o  <  i  i  1-1. 


An  averse  hierarchy  accas.  tine  f  Is  then  obtained  a.  the  in. 


verse  of  the  average  number  of  references  serviced  per  time  unit  In  j.  (K): 

,_1  *  ,-l 


t<TL(N)]' 


C.  E.iaL(Vl)aL-l<nL-l)] 

\-l 


(6.10) 


the  long-run  equilibrium  probability  a^),  n^-  0 . .  being  given  by 

(4.8). 

Moreover,  an  average  fraction  of  time  during  which  all  N  computa¬ 
tions  are  waiting  for  a  page  transfer  from  some  memory  level  M£,  l  >  1, 
to  Mq  is  given  by 


,w  -  v°> 
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It  may  be  proved  that  and  are  bound  by  the  same  relation 
(6.4)  as  1]  and  T: 

Lemma  6.1  \  "  1  -  [p^XT^-1. 

N 

Proof:  s0(0)  a  a  (0)  -  l  .  E  a^), 

V1 


Tj  and  T  being  defined  by  (6.4)  and  (6.3)  re  pectively,  the  follow 
ing  relations  may  also  be  established: 

Lerrna  6.2  ^  ■  T;  7^  -  7)f 

Proof :  From  the  definition  (6.10)  of  T^,  it  results  that  for  t^l: 
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Ti  -  [aL(l)  CT^d)]'1  -  [nL(l|l)  aL_1(l)]"1.  (6.11) 


-1 


In  the  queuing  system  5S^(1),  j8f2 . L,  TT^(l|l)  may  be  ex¬ 

pressed  as  (see  (4.7.2)): 


h-i  i(1) 

V1  *>  ■  -1  :i - 


(6.12) 


In  this  particular  case  where  transfer  probabilities  p^,  k  >  j&, 
l>  0  are  all  zero,  (4.3)  and  (4.4)  yields,  using  (6.8)  and  (6.9): 

*j&-l,jt  (1)  "  tpX  (Cj&- 1*°^  X  al-l  t>a2»  (6.13) 

Replacing  tt^ ( 1 1 1 )  in  (6.11)  by  the  value  yielded  by  (6.12)  and  then 
^L-l  L  ^  by  the  value  yielded  by  (6.13),  gives: 

^1  =  PL  ^CL-1,CL^  ^i/  +  aL-l 

Using  the  relation  cj^d)  ■  TT^(l|l)  c^d)*  successively  for  jJfL-1,L-2,.  .  .2, 
gives: 

-  2  -1-1 
Ti  "  Vl  <CA-l,cl>  +  pi  (1)> 

But  CT11(1)  -  [|±0  TTj  (1 1  1 )  ]  _1 


where,  by  virtue  of  (4.5)  and  (4.6),  tt”1  (1 1 1)  may  be  replaced  by 

_-i„i,x  _  C|J,o  pi(co,ci)]  1  +  ^i1 

TTj  (1 J 1)  - - ^ -  ; 

Cm-o  Pi<co»c1>J 


iu.  *!***, 
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which  gives  evidence  that  -  T.  Then  \  -  Tj  results  directly  from 
Lemma  6.1,  which  completes  the  proof. 


Furthermore,  let  us  remark  that  the  utilization  factor  of  the 
storage  hierarchy  component  which  performs  transfers  from  to  MQ,  l  >  1, 
is  equal  to  (l-s^O)),  probability  8^(0)  being  defined  by  (4.8).  Like¬ 
wise  (4.9)  defines  the  mean  response  time  of  this  component. 

Finally,  if  the  uppermost  level  is  the  model  of  some  infinite 
reservoir  from  which  all  computations  originate  at  a  rate  given  by  (2.01) 
and  to  which  they  all  return  when  being  completed,  then  W,  the  mean 
response  time  of  the  aggregate  of  Mq»mi»  •••  defined  by  (4.10),  is 

the  mean  time  required  to  service  all  references  of  a  reference  string. 

6.1.6  Dynamic  Space  Sharing 

By  virtue  of  leranas  6.1  and  6.2,  the  performance  improvement 
achieved  by  multiprogramming  N  computations  so  as  to  take  advantage  of  the 
simultaneity  between  transfers  among  memory  levels  and  servicing  references 
in  Mq  may  be  measured  by  the  ratios: 

h  or  %  <6-14> 

In  general,  there  will  exist  one  optimal  value  NQpt  which  max¬ 
imizes  these  performance  ratios.  This  optimum  results  from  two  counter¬ 
acting  effects,  on  the  one  hand  the  probability  (l-a^O))  of  having  at 
least  one  computation  not  waiting  for  a  page  transfer  from  some  level 
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M.,  i  >  1,  to  M„  Is  inclined  to  increase  with  N;  on  the  other  hand  the 
l*  ’  0 

c  a 

average  memory  space  J&»0,1,...  available  to  each  computation  at  the 

lower  levels  shrinks  as  N  increases  with  the  consequence  that  the  prob- 
C «_i»c  « 

abilities  p  —  zr)  of  accessing  the  upper  levels  increase  rapidly. 

I  N  N 

The  probabilistic  model  described  above  provides  a  means  to  evaluate  this 
optimum,  given  (i)  a  family  of  programs  defined  by  their  reference  strings 
from  which  access  probabilities  may  be  inferred  by  e.g.,  a  stack  processing 
technique  like  that  proposed  in  [Mg70],  and  (ii)  a  nearly-completely-decomposable 
memory  hierarchy  of  an  arbitrary  depth,  each  memory  level  being  character¬ 
ized  by  its  capacity  and  a  distribution  of  the  amount  of  time  required  to 
access,  retrieve  and  transfer  a  page  from  this  level  to  the  executive  memory. 

So  far  we  have  assumed  that  a  fixed  and  equal  portion  — ,  j6=0,...L 
of  memory  space  was  permanently  allocated  at  every  level  to  every  computa¬ 
tion  in  the  hierarchy.  This  is,  howevur,  not  the  case  in  most  hierarchical 
storage  systems.  Usually,  in  order  to  keep  moderate  the  access  probabilities 
to  the  upper  levels,  an  upper  limit,  say  Jmax»  Is  imposed  upon  the  number 
of  computations  J  allowed  simultaneously  to  share  space  in  the  faster  memory 
levels,  say  MQ . Mjj(max)*  Moreover*  any  of  these  computations  Is  expro¬ 

priated  from  the  space  it  has  accumulated  in  these  levels  as  soon  as  it 
makes  a  reference  to  or  above  a  certain  level,  say  M^,  k  >  A(max) .  Thus,  if 
n^  j  designates  the  number  of  computations  not  waiting  for  a  page  transfer 
from  levels  we  have 

J  -  mlnCn^j,  JMX)  i 


and  the  average  space  allotted  to  any  of  these  computations  at  any  time 
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Instant  T(rP),(y-1.2,...),(p-l . J>*  «  *”?  Uvel  Mf  1  *  lv"ax)>  U  the 

y  _1 
integer  part  of  (c^  X  J  )• 

This  space  allocation  policy  preserves  the  near -complete-decomposab ill ty 
property  of  the  hierarchy  since  its  net  effect  is  to  enhance  the  differ¬ 
ence  between  access  probabilities  to  the  lower  levels  MQ . MA(max)» 

and  the  upper  levels  . V  ^ax  distinct  access  probability  dis¬ 

tributions,  conditioned  on  the  value  of  n^,  must  be  considered: 


r  Z;1  ci c A,  .u,  L 

P,<  s  t^1 . L 


DC^): 


1  1-0  aial 


L  Jt-1  c,  c 

|P0  ■  1  “  E  P«(  E  ““»“>» 

0  jfr-1  1  i-0  ®i  ai 


(6.15) 


fmin(nk.1»  Jmax)»  tm°9 •  •  •  ’ A(max)  ' 


where  a. 


IN, 


i*»  l(roax)+l , .  •  •  4*  • 


There  are  three  stages  in  the  analysis  of  such  a  hierarchy: 

(i)  The  bottom  levels  MQ . are  ®od«llcd  by  Jmax  a86re«ate8 

3jl  ^(j),  j-1 . Jmax,  each  one  with  the  appropriate  access 

probability  distribution  D(J).  The  analysis  of  these  aggregates 

yields  the  interactive  rates  ♦jR(max)fi<J>  of  the  bottom  levels 

upon  each  level  i  a  ^max"^-**  *  * 

(ii)  Next,  one  considers  N  aggregates  Vl"1 . N*  1,16 

interaction  rate  of  the  bottom  levels  upon  any  one  intermediary 

level  M  „  i'=/  +1 . k-1,  in  each  of  these  aggregates  is  taken 

i*  max 

equal  to 
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^(max),i'(nk-l)  if  Vl  <  Jmax 

'l'£(max),l'(Jmax)  if  Vl  *  Jmax* 


(111)  The  analysis  of  each  aggregate  yields  the  Inter¬ 
active  rates  i|^_^  uPon  the  upper  levels!", 

1* '“k,...  ,L;  using  these  Interaction  rates  in  (4.7.2)  and 
(4.5),  the  upper  levels  may  be  modelled  by  a  system  Bl^N) . 


The  same  argument  which  leads  to  an  optimal  value  for  N,  Is 
applicable  to  J  .  The  model  outlined  above  may  be  used  to  estimate  the 

lUaX 

value  of  an  optimal  maximum  degree  of  multiprogramming  In  the  lowest 

memory  levels,  which  for  a  given  N  value  maximizes  the  ratios  (6.14).  Such 
an  optimal  value  J°P*  will  be  calculated  In  the  last  section  for  a  hypo¬ 
thetical  computer  system  enjoying  the  space  allocation  policy  discussed 
above. 


6.1.7  Remark 

A  variant  of  the  model  investigated  here  may  be  found  in  [Cou70/l], 
[Cou70/2].  There,  a  computation  is  stochastically  defined  by  a  set  of 
arbitrary  distribution  functions 


B^x)  -  Probfc^  £  x],  jSS=1,  . . .  ,L 


where  the  random  variable  Q ^  is  the  number  j  of  references  made  by  a 
computation  p  within  the  time  interval  (T(rP  )  -  T(rp)l  if  T(rp)  and 

y+J  y  y 

T(rP. .)  are  two  successive  instants  of  an  access  to  memory  level  M  . 

y+j  f 

If  n  is  an  arbitrary  number  of  computations,  the  expectation  £.  and  the 

At 


access  probability  PjCC^.c^)  are  related  by 


«'i '  xdBi(*> 


f, 


E  d(p) 
p-1 


[pi(Ci-l*ci>]  • 


When  the  system  is  nearly  completely  decomposable,  the  conditional 

probability  distribution  ^(n^n,),  *1 . .  n^l . N,  is  obtained  as 

the  distribution  of  the  congestion  in  a  m|g|i|n  queuing  process  whose  dis¬ 
tinctive  feature  is  that  the  service  rate  a^Oi^) ,  n^-l,...^  (defined 
by  (6.8),  (6.9))  at  which  references  are  serviced  is  dependent  upon  the  cur¬ 
rent  congestion  n^.  This  type  of  m|g|i|n  queuing  process  is  studied  in 
[C6u71]. 
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6.2  Hierarchical  Structure  of  Multiprogramming  Computer  Operating  Systems 
The  technique  of  aggregation  proved  useful  to  (i)  break  a  system  up 
into  a  small  number  of  subsystems,  (ii)  evaluate  the  interactions  within  the 
subsystems  as  though  interactions  among  subsystems  did  not  exist,  and  (iii)  eval 
uate  the  interactions  among  subsystems  without  regard  to  the  interactions  with¬ 
in  sybsys terns .  We  may  expect  that  this  evaluation  technique  has  some  simi¬ 
larity  with  the  design  technique  which  consists  of  assembling  a  complex  system 
from  subsystems  designed  independently;  we  may  expect  that  the  criteria  that 
indicate  what  variables  to  aggregate  might  also  help  in  specifying  what  the 
building  blocks  should  be. 

In  particular,  it  seems  worth  discussing  the  similarity  existing  between 
the  hierarchical  model  of  aggregate  resources  defined  in  Section  IV  and  the 
well  known  hierarchical  organization  advocated  by  E.  W.  Dijkstra  for  the 
software  of  multiprogramming  computer  systems  [D169/1], 

6.2.1  Levels  of  Abstraction 

An  essential  function  of  a  multiprogramming  computer  operating 
system  is  to  control  and  allocate  hardware  resources.  The  problem  of  creat¬ 
ing  a  system  which  achieves  this  function  could  be  formulated  as  follows: 

Given  the  hardware  of  a  computer,  construct  a  set  of  programs  whose  function 
is  to  provide  a  collection  of  convenient  and  efficient  abstractions  from  some 
physical  characteristics  (quantity,  speed,  access  mechanism,...)  of  these 
resources;  two  constraints  which  further  complicate  the  problem  are  that: 

(i)  to  achieve  these  abstractions,  the  programs  themselves 
use  or  consume  hardware  resources; 


IMMP 


6.19 


(11)  if  these  abstractions  are  to  serve  any  useful  purpose, 
their  designer,  because  of  his  own  "inability  to  do 
much"  [Di69/2],  should  be  their  first  beneficiary; 
rather  than  using  and  being  defined  in  terms  of  raw 
hardware  resources,  the  programs  should  insofar  as 
possible  be  defined  in  terms  of  and  use  these  abstractions. 

E.  W.  Dijkstra  showed  [Di68],  [Di69/l]  that,  from  a  designer  point 
of  view,  it  is  advantageous  to  structure  such  a  system  as  a  hierarchy  of 
levels  of  dbetraotion .  He  considers  an  ordered  sequence  of  machines 
^O'^l*  *  *  *  'AjJ*  *  *  *  w*iere  the  level  is  the  given  hardware  and  where  the  soft¬ 

ware  of  level  l,  (f 0,1,...,  defined  in  terms  of  and  executed  by  machine  A., 

tr 

transforms  machine  into  machine  A^.  The  software  of  each  level  t  creates 
an  abstraction  from  sane  physical  properties  of  the  hardware.  Examples  of 
such  abstractions  implemented  in  the  THE  system  are: 

-  At  level  0,  the  central  processor  is  allocated  among  concurrent 
processes  so  that  the  actual  number  of  these  processor(s)  (one  in 
this  system)  is  no  longer  relevant:  each  process  ready  to  use  a 
processor  may  be  considered  above  this  level  as  having  access  to  its 
own  virtual  processor; 

-  at  level  1,  the  differences  in  mode  and  speed  of  access  between 
a  drum  and  a  core  memory  are  abstracted  from  in  order  to  create 
an  homogeneous  store.  Above  this  level  the  physical  location  of 
information  in  this  store  is  no  longer  relevant;  information  is 
identified  by  a  segment  name. 
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Each  level  gives  access  to  a  certain  type  of  abstracted  resource, 
e.g.,  a  virtual  Processor,  a  segmented  address  space,...  To  do  so,  it  has 
recourse  only  to  the  abstractions  created  at  the  lower  levels.  This  approach 
facilitates  the  step  by  step  construction,  testing  and  evaluation  of  the 
system.  These  advantages  are  the  direct  consequence  of  the  restricted 
nature  of  the  interactions  which  are  permitted  among  distinct  levels.  This 
very  same  restriction,  however,  leads  to  the  following  dilemma:  if  we  use 
the  simplification  introduced  by  abstracting  from  device  in  producing 
the  abstraction  from  device  R^,  then  we  cannot  use  the  simplification  intro¬ 
duced  by  abstracting  from  R^  in  writing  the  programs  that  abstract  from  R^; 
in  other  words,  each  level  helps  only  those  above  it.  The  choice  of  the 
ordering  of  the  abstractions  may  therefore  be  quite  difficult. 


6.2.2  Aggregation  and  Ordering  of  Abstractions 

It  is  possible  to  identify  at  least  two  types  of  conditions  which 
must  be  satisfied  to  decide  that  the  abstraction  from  a  device  R^  should  be 
created  at  a  lower  level  than  the  level  of  abstraction  of  a  device  Rj  of 
another  type: 


1)  The  abstraction  from  the  details  of  device  R^  should  be  more 
convenient  to  program  the  abstraction  from  device  Rj  than  con¬ 
versely;  if  the  abstraction  from  R^  is  not  more  convenient,  it 
need  not  come  below  the  level  of  abstraction  from  Rj. 


2)  The  use  of  an  abstraction  from  R^  to  program  the  abstraction  from 
Rj  should  not  prevent  the  ] 
and  allocated  efficiently, 


Rj  should  not  prevent  the  latter  resource  from  being  controlled 
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We  may  (and  in  fact  we  shoild)  expect  that  in  most  cases  the  degree 
to  which  an  abstracted  resource  is  a  'convenient*  tool  or  concept  to  program 
will  be  reflected  in  the  execution  of  the  program  by  the  frequency  at  which 
this  abstracted  resource  is  accessed.  Then,  the  result  of  applying  the  first 
condition  to  each  type  of  abstraction  is  that  the  abstracted  resources  imple¬ 
mented  at  the  lower  levels  of  abstraction  are  those  which  will  be  the  more 
frequently  accessed. 

The  second  condition  must  be  interpreted  as  follows.  Drums,  disks, 
tapes,  readers,...  are  mutually  asynchronous  i/o  devices,  each  one  being 
capable  of  transferring  information  at  a  certain  speed.  In  order  to  control 
and  allocate  efficiently  one  of  these  devices,  a  program  must  be  able  to 
execute  at  a  speed  comparable  with  the  speed  of  the  device;  in  other  words, 
the  speeds  at  which  the  resources  (abstracted  or  not)  used  by  such  a  program 
can  be  accessed,  e.g.  those  of  an  abstract  machine  A^,  should  not  be  slower 
than  the  speed  of  the  hardware  being  abstracted  from  at  level  l\  or,  at  least, 
the  resources  which  make  exception  to  that  rule  should  be  used  infrequently. 

The  application  of  conditions  1)  and  2)  to  each  type  of  abstraction 
leads  to  an  ordering  such  that  the  more  frequently  accessed  abstracted. re¬ 
sources  should  be  implemented  at  the  lower  levels  and  should  also  have  faster 
access.  The  interactions  between  the  levels  of  an  'efficient'  and  'convenient' 
hierarchy  of  abstractions  should,  in  other  words,  obey  the  conditions  for 
near-complete  decomposability. 

Since  the  speed  of  access  to  an  abstracted  resource  is  bound  by  the 
speed  of  the  hardware  which  is  abstracted  from,  conditions  for  near-complete- 
decomposability  explain  why  the  fastest  and  more  frequently  used  hardware 
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is  taken  care  of  at  the  lowest  levels  of  a  hierarchy  of  abstractions.  These 
conditions  specify  in  terms  of  usage  and  speed  the  type  of  hardware  which 
should  advantageously  be  abstracted  from  at  a  certain  level  of  abstraction 
and  may  thereby  give  the  designer  some  guidance  in  the  choice  of  the  order¬ 
ing  of  these  levels. 

Another  consequence  is  that  each  level  of  abstraction  may  be 
analyzed  as  a  level  of  aggregation.  Suppose  for  example  that  the  model  of 
the  network  of  queues  defined  in  Chapter  II  may  be  taken  as  a  simplified 
model  of  a  set  of  interacting  abstracted  resources;  the  abstract  machines 
A  j_» .  • .  »A  .  could  then  evaluated  respectively  by  the  aggregates 

^(nj),  (n2) , . . .  *®jj(n^) . each  aggregate  ffll^n^)  being  a  modi  .  of  the 

equilibrium  attained  in  the  abstract  machine  A.^  by  the  interactions  be¬ 
tween  A^  and  the  abstracted  resource  implemented  at  level  L  By  definition, 
the  dynamic  behavior  of  the  aggregate  IRj(n^)  towards  its  equilibrium  may 
be  evaluated  with  good  approximation  merely  in  terms  of  aggregative  vari¬ 
ables  representative  of  the  short-term  equilibria  attained  at  the  lower 
levels  of  aggregation  and  without  regard  to  the  interactions  with  the  upper 
levels.  Since  A^A^...  represent  as  many  distinct  stages  in  the  design, 
production  and  testing  of  an  operating  system,  such  an  evaluation  technique 
which  allows  these  stages  to  be  evaluated  independently  should  prove  more 
valuable  than  techniques  which  may  only  be  used  after  the  design  is  completed. 

To  close  this  section,  let  us  say  that  the  above  discussion  reveals 
a  property  of  these  stages  A^fA^,...  of  the  design  process:  the  rate  of 
interaction  between  the  components  of  a  machine  A  ,  i.e.,  between  A  and 
the  abstraction  implemented  at  level  l- 1,  is  higher  than  the  rate  of  inter¬ 
action  between  the  corresponding  components  of  A^.  Several  arguments  in 
favor  of  a  decomposition  of  the  design  process  into  successive  stages  at 
which  the  subsystems  coped  with  have  a  similar  property,  may  be  found  in  C. 
Alexander's  essay,  "Notes  on  the  Synthesis  of  Form"  [Alb4]. 
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VII.  SHORT-  AND  LONG-RUN  EQUILIBRIA 
IN  A  TIME-SHARING  PAGING  MULTIPROGRAMMING  SYSTEM 


The  purpose  of  this  last  section  is  to  illustrate  the  use  of  an  ag¬ 
gregative  model  in  analyzing  the  performances  of  a  given  computing  system. 

It  follows  on  from  this  concrete  case  study  that  aggregation  is  not  only 
adequate  to  obtain  numerical  results  when  a  large  number  of  parameters 
are  involved,  but  also  helps  to  gain  insight  and  conceptual  clarity  on 
the  parts  played  by  these  parameters. 

A  set  of  hardware  and  software  dependent  parameters  is  defined  as 
the  model  of  an  hypothetical  time -sharing  paging  system.  Conditions  for 
this  model  to  be  nearly  completely  decomposable  are  defined  in  terms  of  the  para 
meters  and  in  terms  of  a  stochastic  representation  of  the  computing  load.  The 
formulation  of  these  conditions  makes  their  prior  verification  possible 
and  may  give  an  assessment  of  the  precision  which  may  be  achieved  by  using 
an  aggregative  model. 

Page  traffic  between  primary  and  secondary  memory  is  studied  as  the 
internal  traffic  of  a  resource  aggregate  of  type  ^(n^).  This  analysis 
focuses  on  the  respective  influence  on  processor  usage  of  the  user  program 
paging  activity,  the  degree  of  multiprogramming  and  the  length  of  a  so- 
called  execution  interval,  viz.  the  amount  of  processor  time  a  user  pro¬ 
gram  is  allowed  to  consume  before  losing  its  pages  accumulated  in  primary 

memory. 

The  entire  system  is  then  regarded  at  a  higher  level  of  analysis  as 
a  finite  set  of  N  active  user  terminals  supplying  tasks  for  this  aggregate. 
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The  congestion  and  the  response  time  of  the  hypothetical  system  are  studied 
as  the  characteristics  of  a  system  (N) .  The  concept  of  stable  and  un¬ 
stable  congestions  is  introduced  as  the  consequence  of  the  peculiar  de¬ 
pendency  of  the  aggregate  service  rate  on  the  system  congestion.  Likewise, 
a  saturation  point  extending  the  definition  given  in[K168]  to  systems  with 
congestion-dependent  service  rate  is  defined.  Finally,  the  evaluation  of 
the  system  response  time  reveals  that  these  concepts  of  stability  and  satura¬ 
tion  relate  the  phenomenon  known  as  thiashing  to  parameters  defining  the  computing 
load  upon  the  system  in  addition  to  those  defining  the  page  traffic  between 
primary  and  secondary  memory. 

7.1  The  Hypothetical  System 

A  schematic  representation  of  this  hypothetical  system  is  given  in 
figure  7.1.  Three  types  of  function  are  essentially  ensured: 

7.1.1.  A  finite  number  N  of  active  user  terminals  originate  random 
requests  for  program  execution. 

7.1.2.  User  programs  are  executed  on  a  multiprograimned  basis  in  a 
primary  memory  MQ  consisting  of  set  of  CQ  pa*;o  frames. 

7.1.3.  Pages  which  cannot  be  contained  in  Mg  are  swapped  in  a  rotation¬ 
al  secondary  memory  M,  (T  «  duration  of  a  rotation) . 

I  rot 

These  functions  are  supposed  to  comply  with  the  following  strategies: 

7.1.4.  A  terminal  cannot  originate  a  request  for  program  execution 
before  the  previous  request  issued  from  the  same  terminal  has  been  served 
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and  completed.  In  other  words,  there  may  exist  a  maximum  of  one  program 
per  terminal  in  the  system  at  a  time. 

7.1.5.  User  programs  are  loaded  from  M^  where  all  pages  are  supposed 
to  be  Initially  located  into  MQ  on  a  page  on  demand,  strategy.  An  upper 
limit  K  is  imposed  upon  the  number  of  page  frames  a  user  program  may  occupy 
in  Mq.  Pages  of  a  same  user  program  superimpose  each  other  whenever  the 
number  of  distinct  pager  required  in  '  by  this  program  exceeds  thi i  upper 
limit. 


7.1.6.  Multiprogramming:  At  any  moment  of  its  lifetime  in  Mq,  a  user 
'program  is  in  one  of  three  states: 

ready  i.e. ,  demanding  but  not  receiving  the  control  of  the 

processor 

running  receiving  the  control  of  the  processor 

suspended  waiting  for  a  page  transfer  between  M^  and  Mq  to  be 
completed 

Multiprogramming  means  that  user  programs  are  concurrently  executed 
in  Mq  in  order  to  maintain  the  processor  busy  as  long  as  not  every  user 
program  is  suspended.  We  assume  that  a  maximum  number  J  x,  0  <  Jmax  s  N, 
of  user  programs  may  at  moat  be  concurrently  executed  in  Mq.  Further  re¬ 
quests  for  program  executions  are  queued  until  one  of  the  J  x  programs 
ceases  to  be  multiprogrammed  (see  7.1.7).  J  will  be  re/erred  to  as  the 

lu&X 

maximum  degree  of  multiprogramming . 
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7.1.7.  Time-slicing  •.  A  program  ceases  to  be  multiprogrammed  either 
when  it  is  completed  or  when  the  total  time  it  has  spent  in  running  state 
since  its  last  loading  in  Mfl  reaches  a  maximum  value  Q.  In  thin  latter 
eventuality  and  if  programs  are  waiting  to  be  multiprogrammed,  the  pro¬ 
gram  will  lose  all  its  pages  accumulated  in  Mfl,  and  will  join  the  queue  of 
programs  waiting  to  be  multiprogrammed  until  it  is  allocated  an  additional 
quantum  Q.  If  no  programs  are  waiting  to  ba  multiprogrammed,  the  program 
keeps  its  pages  in  Mq. 

7.18.  The  tracks  in  Mx  may  be  written  and  read  in  parallel  by  fixed 
heads.  Each  track  is  divided  into  Cj  sectors,  each  sector  containing 
exactly  one  page.  Each  request  for  a  page  transfer  is  put  into  a  queue 
associated  with  the  sector  containing  the  demanded  page  [Wei66]. 

7.2  The  User  Programs 

The  user  programs  are  stochastically  defined  by  the  followin'*  random 
variables: 

(i)  the  user  reaction  time,  i.e.,  the  time  interval  elapsing  between 
the  completion  of  a  user  program  execution  requested  by  a 
terminal  and  the  origination  of  the  nsxt  request  by  the  same 
terminal 

(ii)  the  total  amount  of  processor  time  consumed  by  a  user  program 

(iii) the  amount  of  processor  time  5n,  n«*l,2,...,  consumed  by  a 
user  program  between  any  two  successive  references  rn»*n+1 

to  distinct  pages  not  already  referred  to,  rn+1^rn^rn-i^* • ^rl» 


(user  program  paging  activity). 
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The  distributions  of  these  random  variables  are  defined  in  the  next 
section. 

7.3  Simplifying  Assumptions 

The  hypothetical  system  takes  liberties  with  reality  on  a  certain 
number  of  points: 

7.3.1.  Pages  are  assumed  to  be  uniformly  distributed  over  the 
sectors  of  the  probability  of  a  demanded  page  being  located  in  a 
specific  sector  is  thereby  equal  to  (£  ). 

7.3.2.  We  suppose  that  the  traffic  of  demanded  pages  from  ^  to  MQ 

is  not  hindered  by  the  opposite  traffic  of  pages  from  MQ  to  This 

assumption  is  valid  if  a  page  frame  is  permanently  maintained  vacant  in 

M  as  well  as  in  each  sector  of  M.  [Sm67].  Hence  we  need  only  to  take 
0 

into  account  the  traffic  of  pages  from  to  MQ. 

7.3.3.  The  maximum  number  of  distinct  pages  a  program  may  accumulate 

in  Mq  at  some  ti.de  t  is  a  function  of  the  number  J,1«JS  °f  programs 

being  multiprogrammed  at  that  time  t.  This  function  is  supposed  to  be 

simply 

K(J)  -  entier  (—). 

7.3.4.  We  define  an  execution  interval  as  the  amount  of  processor 
time  consumed  (i.e.,  the  amount  of  time  spent  in  running  state)  by  a  user 
program  between  two  consecutive  loading  of  this  user  program  from  Mj  into 
Mq.  An  execution  interval  is  at  most  equal  to  Q. 
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The  total  amount  of  processor  time  consumed  by  a  user  program  Is 
considered  as  an  Independently,  Identically,  negative  exponentially  dis¬ 
tributed  random  variable  with  mean  (cp)"*.  As  a  result  of  this  assumption, 
an  execution  Interval  Is  an  l.i.d.r.v.  with  distribution  function: 


Q(t) 


Prob(x  £  t) 


(t  *  0) 

<0  <  t  <  Q) 
(Q  *  t) 


Denoting  n(S)  the  Laplace -Stieltj is  transforms  of  Q(t),  one  obtains: 


h(s)  -  r  e"St  d  Q(t>  (Re(s)  >  0) 

'0 

a  PQ  e"«t  te-cptdt  +  e-sQ-cpQ 

Jo 

-  <P  +  se-8<W 
s+cp 


Let  (n) 


denote  the  mean  length  of  an  execution  interval: 


<H)-1  -  h'.(0)  -  (l-e^Jcp'1. 


Q(t)  will  be  approximated  by  an  exponential  distribution  of  parameter  p.. 

7.3.5.  User  reaction  times  are  considered  as  i.i.d.r.  exponentially  dis¬ 
tributed  with  parameter  X;  a  discussion  of  the  validity  of  this  assumption 
may  be  found  in  [Cof66]. 


7.3.6.  Random  variables  ?n,  n=l,2,...,  are  considered  as  i.i.d.r. 
exponentially  distributed  with  respective  parameters  9n,  nBl,2,...  . 
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7.4  Actualizing  the  Hypothetical  System 

The  Hypothetical  System  is  now  defined  in  terms  of  the  parameters 

CA,  c,  ,  T  ,  Q,  J  .  Although  the  aggregative  model  of  section  4  is  in 
0  1*  rot  max 

principle  adequate  for  investigating  the  Influence  of  any  of  these  para¬ 
meters  on  the  performance  measures  defined  by  (4.7),  (4.8),  (4.9)  and  (4.10), 
we  shall  center  our  analysis  around  the  part  played  by  Jmax  and  Q.  And 
in  order  to  be  able  to  put  our  results  in  concrete  form,  we  shall  attribute  to 
each  of  the  other  parameters  a  fixed,  and  hopefully  representative  value,  namely: 

Cq  °  48  page  frames, 

=■  4  sectors, 

_3 

T  .  “  20.10  seconds. 


Likewise,  as  far  as  the  load  on  the  system  is  concerned,  we  shall 
restrict  ourselves  to  the  study  of  the  system  sensitivity  to  fluctuations 
of  the  number  N  of  active  user  terminals.  The  parameters  of  the  distribu¬ 
tion  functions  which  define  the  users'  and  their  programs'  stochastic  be¬ 
havior  are  assigned  values  inferred  from  statistical  observations.  At  the 
time  this  example  was  elaborated,  available  statistics  were  those  observed 
by  SDC,  [Fin66]  [To65],  on  the  Time-Sharing  system  Q-32,  [Sc64]  [Sc67]: 


-  The  total  average  processor  time  (cp) 
is  taken  equal  to  1.39  sec. 


consumed  per  user  program 


-  The  average  user  reaction  time  (X)  is  taken  equal  to  32  seconds. 


-  The  paging  activity  of  user  programs  in  the  course  of  their  exe¬ 
cution  is  given  in  figure  7.3.  It  is  based  on  a  page  size  of 
1024  words.  The  number  y  of  instructions  executed  between  any 
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tr/o  successive  references  r^r^  to  distinct  pages  not  already 
referenced  i8  plotted  against  n.  The  ex- 

ponential  coefficients  0n  are  given  the  values 

9n  "  <yn  X  5*10"6)’1*  w-l,2,...,19, 

where  5.10  6  sec.  is  the  average  total  execution  time  of  at  instruc¬ 
tion  in  the  T-S  Q-32  system.  No  more  than  20  distinct  pages  are 
supposed  to  be  referred  to  during  a  program  execution  so  that 
920"0, 

Remark.  The  branch  of  hyperbola  in  figure  7.3  is  representative  of  the 
behavior  of  programs  whose  page  references  would  be  uniformly  distributed 
over  20  pages.  The  probability  of  such  programs  making  a  reference  to  a 
page  not  yet  referenced  decreases  linearly  with  n,  the  number  of  distinct 
pages  already  referenced.  Obviously  this  probability  decreases  much  more 
rapidly  than  linearly  for  the  program  executions  actually  observed.  This 
gives  partial  evidence  that  these  programs  enjoy  the  property  of  locality , 
vis.  they  favor  a  subset  of  their  pages  at  each  instant  of  their  execution. 
Such  a  program  behavior  has  been  discussed  in  [Be66]  [Va67]  [Be68]  [Dd68/l] 
(be70 ] .  The  values  which  have  been  assigned  to  9n,  n-1,2,...  are  therefore 
representative  of  this  behavior. 

7.5  The  Page  Demand  Rate 

The  page  demand  rate  is  the  mean  number  of  page  transfers  from  to 
M0  re<luested  Per  unit  by  a  program  in  running  state.  In  the  evalua¬ 
tion  of  this  rate  two  cases  must  be  distinguished  depending  on  whether 
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more  than  J  requests  for  program  execution  are  pending  or  not. 

7.5.1.  In  the  first  case  the  Jmax  programs  which  are  multiprogramroed 
lose  all  their  pages  in  MQ  when  the  execution  interval  has  run  out.  The 
minimum  number  of  page  transfers  requested  over  an  execution  interval  is 
therefore  equal  to  the  number  of  distinct  pages  referred  to  during  this 
interval.  This  number  is  augmented  whenever  the  number  of  distinct  pages 
referred  to  exceeds  K(Jmax)  so  that  pages  have  to  superimpose  each  other. 

Following  the  approach  taken  in  [Sm67],  the  average  number  a(Jmax»  M-> 
of  page  transfers  per  execution  interval  may  be  obtained  as  the  average 
number  of  events  of  a  non-homogeneous  Poisson  process  occurring  within  an 


exponentially  distributed  time  interval: 


0  K(Jmax)_1  e„  K(Jmax>"1  f 

^  =  1  +  VT  +>  ‘  *+  n^l  V*  +  n-1  V*  |  ^ 


K(Jmax)_1  z  6„  K(Jmax)"1  6  eK(Jmax> 


(7.1) 


a<J-^  -  1  +  £  jll  ^  +  A 


It  is  assumed  in  (7.1)  that  the  page  transfer  request  rate  remains 

constant  and  equal  to  6  (  }  after  page  superimposition  first  occurs,  viz. 

v  max  th 

after  the  occurrence  of  the  K(Jmax)  transfer  request. 

The  page  demand  rate  is  thus  equal  in  this  first  case  to  M  MJ^x’^* 

7.5.2.  In  the  second  case  the  J  programs,  1  s  J  s  Jmax*  which  are 
mu 1 1 ip rogr ammed  do  not  lose  at  the  end  of  each  execution  interval  the  pages 
they  have  accumulated  in  MQ.  On  the  assumption  that  in  this  case  programs 
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are  never  throughout  their  execution  deprived  of  their  K(J  )  page 

max 

frames  In  Mfl,  the  page  demand  rate  may  be  approximated  by  (jp  x  d(J  cp). 

max* 

This  assumption  will  be  Justified  by  the  conditions  for  near-decorapos- 
abillty  discussed  in  $7.8. 

7.5.3.  We  will  assume  that  the  probability  of  a  page  missing  in  Mfl 
within  some  interval  (t,t4h]  when  a  program  is  running t  depends  on  h  only 
and  not  on  t,  and  is  equal  to 


h  X  pi  X  S(Jjnax,|A)  +  0(h) 

or  h  X  cp  X  a(J,cp)  +  0(h),  1  *  J  £  j  , 

max 


according  to  the  number  of  requests  for  program  executions  which  are  pending. 

7.5.4.  Some  basic  properties  of  the  page  demand  rate  may  be  found  in 

expression  (7.1),  taking  into  account  that  coefficients  9r,  n-1,2 .  are 

representative  of  a  general  property  of  program  paging  activity.  Consider¬ 
ing  only  the  first  case  above  and  denoting  by  ^  the  page  demand 

max' 

rate  difference  which  would  result  from  an  allotment  of  an  additional  page 
frame  (KUmax)  -  K(‘JmtLX>  +  D  to  each  multiprogramraed  user  program,  we  have 


\(J  ) 

max 


K(J  ) 
max 

-  n 

n=l 


9 

_D_  /e 

9+U  )+l 

n  max 


9k(j  ))' 
max 


The  shape  of  the  paging  activity  displayed  in  figure  7.3  is,  on  the 
average,  such  that 


K(J  ), 
'  max" 


1  s: 


K(J  ) 
max' 


s:  20. 
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S  *  92  2  •••  2  eK(J  , 

max 


(7.3) 


and  the  stronger  the  locality  property  is,  the  greater  the  inequalities  are. 

^(•LaJ  is  thereby  negative  for  all  values  of  K(J  )  with  the  well  known 
consequence  that  the  page  demand  rate  ia  a  decreasing  function  of  the 
primary  memory  apace  guaranteed  to  each  multiprograrnmed  program.  The 
product 


K(J  )  A 
max'  0 


n-1  n 


(7.4) 


'decreases  as  K(Jmax)  increases.  This  decrease  is  more  than  exponential 

owing  to  inequalitites  (7.3).  l^/j  *|  decreases  even  faster  than  this 

‘'max'' 

product  when  K(Jmax)  increases,  strong  evidence  being  given  by  figure  7.3 

(and  other  studies,  see  e.g.  [Va67]),  that  the  decreasing  function  0  is 

n 

strictly  concave  since  for  its  inverse  y  : 

n 


Vn»yn  >  y 


where  y  is  a  positive  branch  of  hyperbola;  hence 


l02"eil*IVe2l*-*»leK(j  )+r0K(j  )!• 


Thus  the  page  demand  rate  increases  all  the  more  steeply  when  the 
main  memory  space  guaranteed  to  each  program  shrinks  as  this  space  allot - 
ment  is  small . 

The  product  (7.4)  and  thus  the  decrement  v  increase  also  as 

^  max^ 

M-  *♦  0,  this  influence  of  y,  growing  exponentially  with  K(J  )  . 

*  max 


7.12 


On  the  other  hand,  It  results  from  (7.1)  that  n  x  d(J  ,p.)  -*  g 

'max' 

as  "*  0  and  M  X  a(Jmax*M-)  "*  *a  n  -♦  ®,  where,  owing  to  inequalities 

(7.3),  ®k(J )  is  a  minimum  for  any  fixed  value  of  K(J  ). 

max  '  max 

So,  for  any  value  K(Jmax)  of  the  main  memory  allotment  the  page 

demand  rate  approaches  a  minimum  0  v  as  the  average  execution  interval 

'max'' 

is  increased  until  it  becomes  equal  to  the  average  processor  time  consumed 

per  user  programs .  When,  the  average  execution  interval  decreases  (m  “♦  ®) , 
the  page  demand  rate  finishes  by  growing  like  ji. 

7.5.5.  These  tendencies  of  the  page  demand  rate  in  function  of  K(J  ) 

max 

and  (p.)  are  illustrated  by  figures  7.4  and  7.5.  These  figures  reveal 
that  from  the  point  of  view  of  the  page  demand  rate,  there  would  not  be  much 
to  be  gained  from  guaranteeing  each  program  a  parachar  (*)  of  more  than 
about  seven  page  frames  in  and  an  execution  interval  longer  than  0.2 
seconds;  conversely  they  show  that  a  minimum  execution  Interval  of  0.05 
seconds  and  a  minimum  parachor  of  four  page  frames  are  mandatory.  It  will 
be  shown  in  ^  7 .9.2  that  these  parachor  values  which  are  satisfactory  for 
programs  considered  as  individuals  are  far  from  optimising  performance 
criteria  which  take  page  transfer  rates  into  account  .  what  is  optimal 
for  programs  considered  as  individuals  will  not  appear  optimal  when  they 
are  considered  collectively. 


* 

Parachor  is  a  term  in  use  to  designate  the  amount  of  primary  memory  needed 
by  a  program  to  achieve  some  "satisfactory"  level  of  performance  [Be68]  . 


7.6  The  Rate  of  Page  Transfer  Completions 


Let  J,  1SJS  J  ,  be  the  number  of  programs  being  multiprogrammed 
at  some  time  t  and  i',  0  £  i[  *  J,  be  the  number  of  programs  being  in 
suspended  state  at  this  time  t.  It  is  shown  in  the  appendix  of  [Sm67] 
that  because  of  the  simplifying  assumptions  7.3.1  and  7.3.2,  the  rate  at 
which  page  transfers  from  Mx  to  MQ  are  completed  may  be  taken  equal  to 


21i 


Cj+2i'-i 


(7.5) 


rot 


•This  rate  is  obviously  null  for  t[  -  0,  and  approaches  asymptotically 


as  i'  increases  (see  figure  7.2), 
T  1 

rot 


7.7  Conditions  for  Aggregation, 

It  results  from  the  simplifying  assumption  made  in  ^7.3  and  7.5  that 
the  hypothetical  system  may  be  regarded  as  a  closed  multi-queue.?  system  in 
which  a  fixed  number  N  of  programs  cycle  between  three  stages  of:  service, 
i  ,  #=0,1,2,  being  the  number  of  programs  at  stage  £,  we  may  suppose  that 

z 

i  is  the  number  of  programs  in  ready  and/or  running  state; 

0 

1  where  is  the  number  of  P*'ograras  in  suspended  state 

and  i'l  the  number  of  programs  waiting  to  be  multiprogrammed; 

i  is  the  number  of  terminals  without  pending  request  for 

2 

program  execution. 

At  any  time  t,  t  >  0,  when  J  programs  are  being  multiprogrammed,  these 
variables  must  obey  the  following  relations: 
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12  -  N-djj+lJ+l”), 

(lr>°)  3  <10+1i  -  Jmax>’ 

(1r>o)  s  «"-V  >  W- 

Denoting  M-0*1*2*  the  transfer,  rate  from  stage  i  to  stage  J, 

j*.  we  have 


^0P01  “  1 

/  —  lV  >  0 

^cpXd(J,tp)  if  1'^  ■  0 

(7.6) 

(7.V) 

^0P02  " 

<P 

(7.8) 

^1P10  " 

l^i (1^)  (see  equation  (7.2)) 

(7.9) 

^2P21  " 

xxi2 

(7.10) 

All  other  transfer  rates  being  null,  the  condition  (5.1)  for  near-ccinplete-de- 
composablllty  amounts  in  this  case  to: 


For  i2  -  0,...,N-Jmax, 

Xxl,  «  min  (1*  ),0i.xd<J  a„»n))-cp] 

1  l£l'£J  1  1  max 

1  max 


and  for  " 


N-J  +1 . N-l 

max  ’  * 


xxi2  « 


min 

1*1 'J*J  „ 

1’  max 


[^(ip.tpxOCJ.cp)-!)]. 


Since  we  have  <p  £  and  tpxd(J,cp)  £  1*X5(J  wx»>t) » (J  s  Jmax^  as  weH  as 
for  a  <  b: 


<pxd(a,<p)  *  cpx9(b,cp)  and  ^(a)  *  ^(b). 


a  necessary  and  sufficient  condition  for  (7.11)  to  be  satisfied  Is 
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X(N-l)  «  minfp^U),  cpx(d(l,cp)-l) ], 
or,  replacing  pt^(l)  by  its  value  yielded  by  (7.2): 

\(N-1)  «  min  [fcr-  X  “rf),  cpX(9(l,tp)-l)  ].  (7.12) 

rot  cl‘ri 

Condition  (7.12)  stipulates  that  the  lowest  possible  rate  of  page  trans¬ 
fers  between  primary  and  secondary  memory  be  higher  than  the  highest  possible 
rate  of  interaction  between  the  set  of  terminals  and  these  memories.  This 
is  in  fact  a  common  state  of  affairs  and  we  may  reasonably  expect  to  see 
this  condition  satisfied  by  most  paging  computer  systems. 

Moreover,  Condition  7.12  is  strictly  expressed  in  terms  of  the  hardware 
parameters  N,^,  T  ,  the  user  reaction  time  (X)“l,  and  the  characteristics 
of  a  non-mu l tiprograrmed  user  program  execution,  namely: 

(tp)  ,  the  average  processor  time  consumed  per  execution; 

&(1,<P>,  the  average  number  of  page  faults  per  execution  if  the 
entire  primary  memory  space  Cq  is  available;  relation 
(7.1)  completely  determines  9(1, tp)  in  function  of 
tp,  CQ,  and  the  user  program  activity  (§n),  n“l . 

A  prior  knowledge  of  these  hardware  and  load  parameters  therefore, makes 
the  verification  of  near-decomposability  possible  and  may  give  a  prior  esti¬ 
mation  of  the  precision  achievable  by  using  an  aggregative  model.  In  this 
particular  case,  given  the  values  assigned  in  tf»7.4  to  tp,  X,  Trot  and  c^ 
and  since  9(1, to)  ~  19,  (7.12)  is  satisfied  as  long  as 
N  «  1+X  1Xtpx(9(l,cp)-l)  ~  300. 


Under  this  condition 
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the  hypothetical  system  is  near ly-conipleceiy -decomposable  into  the  set  of 
user  terminals  on  the  one  hand,  and  a  primary-secondary  memory  aggregate 
on  the  other  hand.  The  short-run  equilibrium  attained  by  the  aggregate  may 
be  studied  Independently  of  the  Interactions  with  the  user  terminals;  the 
long  run  equilibrium  attained  by  these  Interactions  may  be  studied  In  ti*rms 
of  the  equilibrium  characteristics  of  the  aggregate. 

’/t 

7.8  Analysis  of  the  Aggregates  IRj (J),  SIR  (Jmax) 

Throughout  this  short-run  equilibrium  analysis  we  may  therefore  disre¬ 
gard  the  activity  of  the  user  teimlnals.  We  may  simply  assume  that  a  fixed 
number  (N-lj),  0  £  ij*  N-l,  of  requests  for  program  execution  are  pending. 
This  Implies  that  a  fixed  number  J  of  programs,  J  -  mln[(N-l?) ,j  ]  are 
being  permanently  multlprogrammed.  It  was  shown  In  ^7.5.1  and  7.5.2  that 
depending  on  whether  or  not  (N-i.)  >  J  ,  the  page  demand  rate  was  differ- 

ent.  For  this  reason  we  consider  two  types  of  aggregates:  A  saturated 

* 

aggregate,  denoted  51  (J  ),  In  which  the  page  demand  rate  Is  equal  to 

max  * 

and  anon-saturated  aggregate  OT(J) ,  J-1 . j  ,  in  which  the 

max 

page  demand  rate  Is  equal  to  <pxd(J,cp). 

Considering  this  latter  case  first,  a  probability  distribution 

TT2^n0^J^»  •  •  •  »*^|nax*  no  programs  being  In  ready  or  running 

state  on  condition  that  J  programs  are  being  multlprogrammed  may  be  obtained 
as  the  distribution  of  the  congestion  In  a  ^(J)  queueing  system,  whose  In¬ 
put  rate  Is  the  page  transfer  rate  and  whose  service  rate  Is  the  page  demand 
rate  cpxd(J,<p).  Since  the  page  transfer  rate  Is  dependent  on  the  congestion, 
probabilities  TT2(n0|j)  are  defined  by  relations  (5.3).  Introducing  Into 

(5.3)  the  rates  yielded  by  (7.9)  and  (7.7)  yields  for  J=»l . J  : 

*  max 


irraam  a  ■  mu  i 
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^(n0lJ) 


n  ^.(k) 

[<pXd(J,d]nO 


TTjlOlj), 


nQal  »•••!»! 


with  tTj (0 1 J)  p*  ( 


J-nO+l 

j  n  n,  (k) 

1  +  E  k=J  1 

nnal  [tpxa(j,(p)]ho 


i-i 


Next,  replacing  [cpXS(J,cp)]  by  [y,xd(J 

max 

* 

tlon  for  OT  (J  )  which  will  be  designated 
max 

In  particular, 


»M>)  ] »  yields 

tVnolJma*)> 


a  similar  dlstribu- 
J 

max 


ajU)  -  llt|(o|j),  J-l . Jmax, 

or  in  the  saturation  case 

cr?(J  )  ■  l-nt(0|j  ), 

is  the  probability  of  the  processor  not  being  idle  when  J  (or  J  programs 
are  being  multiprogramned.  This  probability  will  be  henceforth  referred  to 
as  the  processor  efficiency. 

•k 

In  figure  7.6,  a. (J  „  )  is  tabulated  for  1  £  J  £  20,  i.e., 

1  max  max  ’  ’ 

2  £  ^(J  x)  ^  48,  and  ^or  f°ur  values  of  the  average  execution  interval  [y,]  *. 


7.8.1  Optimal  Maximum  Degree  of  Multlprogrammine. 


Figure  7.6  reveals  that  distinct  optimal  maximum  degrees  of  multi¬ 
programming,  say  correspond  to  distinct  values  of  the  execution  inter- 

max 

opt  * 

val.  J  j,  maximizes  the  processor  efficiency  cr.  (J  ),  i.e.,  maximizes  the 
max  i  max 


stun 


J-n0+l 

T  n  n, (k) 

max  k“J 

j,  max _ 

"0=1 
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An  essential  property  of  near-dacomposablllty  is  that  the  relative 

values  of  the  short-run  equilibria  attained  in  each  system  , 

J  „  “1,2,...,  are  maintained  in  the  long-run.  Therefore.  J°Pt  is  also  the 
max  *  max 

maximum  degree  of  multiprogramming  which  maximises  the  long-run  fraction  of 
time  the  processor  is  busy  when  the  system  is  saturated. 

The  existence  and  the  value  of  this  optimum  depend  on  the  relative 
importance  of  two  antagonistic  effects:  as  the  degree  of  multiprogramming 
increases,  processor  efficiency 


(1)  tends  to  increase  the  probability  of  at  least  one  program 
being  in  read)*  state, 


(ii)  tends  to  decrease  owing  to  the  increase  of  the  page  fault 
rate  which  results  from  the  reduction  of  primary  memory 
space  allotted  to  each  program. 

A  method  to  evaluate  such  an  optimum  has  already  been  proposed  in 
[Wa69]  but  it  applies  to  the  simpler  case  of  identically  distributed 
channel  service  times  and  a  geometrically  distributed  number  of  page  faults 
per  program  execution;  thus  the  locality  property  of  programs  is  not  taken 
into  account. 

Figure  7.6  reveals  also  that  the  optimal  maximum  degree  of  multi¬ 
programming  inorea8ea  aa  the  average  execution  interval  [|i]  1  deoreaae8» 
Indeed,  the  second  counteracting  effect  does  not  become  operative  until 

programs  have  exhausted  their  primary  memory  allotment  K(J  ).  The  shorter 

ItlAX 

the  execution  interval,  the  smaller  this  exhaustible  space  allotment  becomes. 
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7.8.2.  Thrashing 

It  is  evident  from  figure  7.6  that  the  primary  memory  allotment 
which  must  be  guaranteed  to  each  program  in  order  to  achieve  maximal  pro¬ 
cessor  efficiency  is  considerably  larger  than  the  parachor  value  evaluated 
in  ^7.5.5. 

The  difference  is  justified  by  the  fact  that  the  parachor  is  an 

* 

individual  program  characteristic  whereas  o^(J)  and  are  measures 

of  the  overlapping  achieved  between  page  transfers  and  processor  busy 
periods  and  depend  on  secondary  memory  transfer  rates  as  well. 

P.  J.  Denning '9  argumentation  on  trashing  [De68|2]  helps  us  to 
understand  why  the  parachcr  is  smaller  than  K(J°*>t).  A  measure  e  of  the 
ability  of  a  program  to  use  the  processor  may  be  defined  as 


where 


-1 


+  a(Jmax*‘i)  X  (ill)_1 


J  -1 
max 


VJW  *VJmax"V 


V° 


is  the  average  service  rate  of  page  transfer  requests;  [d(J  „  .p.)  X  (pu)  ] 

max  1 

is  thus  the  average  time  a  program  spends  waiting  for  page  transfers  during 
an  execution  interval.  Denoting  by  m  the  page  transfer  rate,  we  obtain 


1  +  m  X 


(i^r1 


The  slope  of  e  in  function  of  m  is 
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de  (ji, ) 

T"  "  _  1  » 

diQ  i  2 

[1  +  m  X  (nj^)  ] 

which  means  that  e  is  more  sensitive  to  m  variations  as  is  great  and 

m  small.  Therefore,  the  larger  (y^)  1  is,  the  larger  K(jJ^)  and  the  smaller 

jopt.  Urge  (J.)’1  values  are  responsible  for  K(J°P|;)  exceeding  the  parachcr. 
max  1  _  _i 

This  extreme  sensitivity  of  processor  efficiency  for  large  (p^) 

to  m  variations  is  responsible  for  serious  degradations  of  the  system  re¬ 
opt 

sponse  time  when  the  degree  of  multiprogransning  is  allowed  to  exceed  J^. 

These  degradations  are  analyzed  in  paragraphs  7.12  and  7.13. 

7.8.3.  Processor  Efficiency  Versus  Execution  Intervals 

Figure  7.6  shows  that  the  longer  the  mean  execution  interval  is, 
the  smaller  the  primary  memory  space  K(Jmax>  required  to  attain  a  specified 
processor  efficiency  is.  Or,  that  the  larger  KCJ^)  is,  the  more  advantageous 
it  is  from  the  point  of  view  of  processor  efficiency  to  provide  for  the  long¬ 
est  possible  execution  intervals.  This  is  a  consequence  of  the  increasingly 

preponderant  influence  (ji)"1  has  on  the  page  fault  rate  decrement  y 

r  max 

(cf.  ^7.5.4)  as  K(J  )  increases,  owing  to  the  property  of  locality  of 
references  (inequalities  7.3). 

Therefore  a  policy  which  optimizes  processor  efficiency  by  taking 
advantage  of  the  locality  property,  should  provide  the  user  program  with 
the  longest  possible  execution  intervals.  Such  a  policy  gives  most  effec¬ 
tive  results  when  the  primary  memory  allotment  per  program  is  large,  viz. 
the  degree  of  multiprogramming  small.  This  is  a  supplementary  argument  in 


favor  of  processor  allocation  methods  which  have  already  been  advocated 
[Mu70]  to  minimize  changes  of  tasks  (and  thereby  reduce  overheads  on 
processors)  when  the  system  is  underloaded,  viz.  when  the  number  of  out¬ 
standing  requests  for  program  execution  is  less  than  the  number  of  quanta 

contained  within  the  prerequisite  response  time. 

One  may  also  observe  in  figure  7.6  that  a1(J|nax)  is  Practically 
insensitive  to  J  if  (p.)"1  is  so  small,  in  this  example  less  than  about 

lUaX 

£  e^1  m  430  ns,  so  that  programs  are  not  given  enough  time,  whatever 

KfJ  )  is,  to  accumulate  in  M  the  minimum  number  of  pages  necessary  for 
v  max  u 

the  property  of  locality  to  have  effective  results  (about  four  pages  as 
•shown  in  figure  7  .4) . 

7.9  The  Long-Run  Equilibrium 

The  entire  system  may  now  be  regarded  as  a  set  of  N  active  user  terminals 

originating  requests  for  program  execution  to  the  aggregates  ^(j) ,  J=1»--*>Jmax’ 

and  W*(J  )•  Let  n.  -  (N-i„)  be  the  number  of  pending  requests,  i.e.,  the 
1  max  i 

number  of  programs  being  or  waiting  to  be  multiprogrammed.  We  will  refer 

to  n^  as  being  the  system  congestion. 

We  may  consider  that  the  programs  are  executed  by  a  single  aggregate 

Slt,^)  such  that 

!Dl1(n1)  =  OTjU),  ^  ^  =  J  *  Jmax» 

®i<ni>  for  ni  >  w 

The  rate  at  which  programs  are  completed  by  Wi(n1>  is 
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Vni)’ 


if  n.  SJ  J 
1  max 


^[l-TTjCOlrij)]  -  q>xcr^(iij) , 
W[1-^<°lJmax)]  -  ^l(Jmax)  t£  nl  >  J 


(7.13) 


max 


♦  o(ni>  will  he  referred  to  as  the  system  service  rate .  ^  2^nl^  £s 

displayed  in  figure  7.8. 

The  rate  at  which  requests  are  originated  at  some  time  t  when  the  sys¬ 
tem  congestion  is  equal  to  n^  is  equal  to 

^2^21  "  X(N-n^) ,  nj*0 . N  (7.14) 


Introducing  (7.13)  and  (7.14)  into  (4.7.1)  yields  the  long-run  equili¬ 
brium  probability  distribution  {^(n^N)}”  of  the  system  congestion: 


T^OajN)  -  X 


n1  N(N-l)...(N-n1+l) 


n. 


f  n«*l j • • • |N 


n  *!  2oo 

fc*l  * 


where  tt^  ( 0 1 N)  ■/ 


N  n.  N(N-l) . . . (N-n.+l) 
1  +  £  X  - - - 

ni 

n  *1  1 

1  n  t,  200 

k=l  ’ 


-1 


7.10  The  System  Congestion  Avalanche -Like  Effect 

Assuming  N=20 ,  the  distribution  {^(n^N)}”  has  been  calculated  for 

various  values  of  J  (see  figure  7.7).  Some  representative  values  of  the 

max 

mean  congestion 

N 

E-  £  nx  TT^njN) 


among  those  obtained  are: 
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J 

max 

E 

1 

12 

2 

7.49 

3 

5.25 

4 

5.71 

10 

15.7 

20 

19.1 

These  mean  congestions,  as  well  as  the  distributions  of  figure  7.7 
suffice  to  advocate  multiprogramning  towards  'batch-processing'  (Jmax=1) 
and  to  advocate  policies  which  control  the  maximum  degree  of  multiprogram¬ 
ming  so  as  to  keep  it  around  its  optimal  value  ~  3)  • 

In  particular,  the  last  mean  congestion  in  the  list  above,  almost  equal 
to  N=20,  indicates  how  the  system  may  become  overcrowded  if  all  programs 
present  are  allowed  to  compete  for  primary  memory  An  Intuitive 

interpretation  of  this  behavior  may  be  deduced  from  the  shape  of  as 

a  function  of  n,  for  some  given  value  of  J  (cfr.  figure  7.8):  once 
1  max 

the  congestion  exceeds  an  optimal  value  which  maximizes  the  system  service 

rate  ty.  _(n.),  any  increase  of  the  congestion  results  in  a  service  rate 

decrease  which ,  in  turn ,  accentuates  the  initial  congestion  increase. 

The  upper  limit  J  of  the  degree  of  multiprogramming  acts  as  a  barrier 
rr  max 

against  this  avalanche-like  effect:  the  system  service  rate  never  decreases 
below  cpXa^(Jmax) ,  no  matter  how  great  the  congestion  becomes. 

This  avalanche-like  behavior  of  the  congestion  is  analyzed  in  greater 


detail  in  section  7.12. 
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7.11  The  System  Saturation  Points 

In  figure  7.9,  the  mean  system  congestion  E  has  been  plotted  against 
the  number  N  of  active  user  consoles (1  S  N  S  20)  for  various  values  of  J  . 

As  N  increases,  the  mean  congestion  approaches  and  follows  asymptotes 
of  slope  1.  This  is  due  to  the  fact  that,  for  increasing  values  of  N,  all 
other  system  parameters  remaining  constant: 

tTT^nJx)  -  0)“  ; 

the  system  therefore  tends  to  behave  as  a  N-consoles  system  with  constant 

service  rate  [coxa  (J  )].  Kleinrock  [17]  has  shown  how,  beyond  a  certain 
x  max 

N  value,  such  systems  get  saturated,  each  additional  user  interfering  com¬ 
pletely  with  all  the  other  users  and  adding  one  more  unit  to  the  mean  con¬ 
gestion.  He  has  shown  that  the  corresponding  mean  congestion  asymptote 
of  slope  1  intersects  the  line  E=»l  for  a  certain  value  of  N  which  is  pre¬ 
cisely  the  average  number  of  consoles  the  system  may  handle  without  ex¬ 
periencing  any  mutual  delaying  interference  among  the  user  requests;  this 
number,  which  he  defines  as  the  saturation  point  of  the  system,  is  equal 
to  the  average  number  of  user  requests  which  may  be  satisfied  until  comple¬ 
tion  within  a  time  interval  equal  to  the  sum  of  the  average  user  reaction  time 
and  the  average  user  request  service  time. 

We  may  well  verify  that,  in  figure  7.9,  the  asymptotes  intersect  the 
line  E=1  at  point 

[tpXff*  ( JmaX)  ]  _1  +  X_1 
tcPXCTl(Jmax)]’1 


* 

max 


(7.15) 
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which  would  be  "the  saturation  point"  of  the  system  if  its  service  rate 
were  not  congestion-dependent  but  equal  to  On.  the  other 

hand,  the  behavior  of  the  mean  congestion  in  the  domain  of  smaller  N 
values  is  considerably  different  from  that  of  a  system  with  congestion- 
independent  service  rate.  In  these  latter  systems  the  mean  congestion 
cannot  increase  more  than  linearly  with  N.  The  causes  of  the  sharp  non- 
linearities  which  may  be  observed  in  figur,  7.8  are  discussed  in  the  next 
section  where  a  definition  of  saturation  for  systems  with  a  congestion- 
dependent  service  rate  of  the  same  shape  as  is  introduced* 

7.12  The  Congestion  Stability; 

A  peculiarity  of  the  aggregate  is  that  both  its  input  rate 

XCN-n^  at  which  requests  for  program  executions  are  originated  and  its 
service  rate  ^  ^n^  at  which  these  executions  are  completed  are  depen¬ 
dent  on  the  system  congestion  n^  nj*0,...,N.  The  general  shape  of  these 
rates  is  displayed  in  function  of  ^  on  figure  7.10. 

One  observes  that,  depending  on  the  relative  values  of  N,  cp, 

0  (n  ),  there  may  be  at  most  three  congestion  values  nfi,  nc  for  which 
the  input  equates  the  service  rate,  viz.  which  are  solutions  of 

X(N-nx)  -  ^1>2(lV»  (7.16) 

where  ^  2(nt)  is  given  by  (7.13). 

It  is  simple  to  prove  that  nA  and  are  stable  congestions  around  which 
the  system  is  inclined  to  come  into  equilibrium.  Let  us  consider  the  system 
at  some  instant  when  the  congestion  is  equal  to  n^.  An  increment 
of  the  congestion  will  cause  the  service  rate  to  exceed  the  input  rate. 
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This  excess,  which  is  greater  to  the  extent  that  Anj  is  large  (provided 

n  +  An  £  n  ),  will  tend  to  reduce  the  congestion  to  its  original  value 
A  1  opt 

n  .  Likewise  a  decrement  -An.  would  cause  the  input  rate  to  exceed  the 
A  1 

output  rate,  compelling  the  congestion  to  re-increase.  The  same  reasoning 
applies  to  the  congestion  n^. 

Inversely,  a  similar  argumentation  indicates  that  in  the  vicinity  of 

n  the  congestion  variations  are  reinforced  instead  of  being  deadened  by 
B 

the  alterations  they  cause  to  the  output  to  input  rate  ratio  .  is  an 
unstable  congestion ,  i.e.  a  state  the  system  will  always  be  zealous  to  leave 
in  favor  of  stable  congestions  fa  the  vicinity  of  n^  or  nc« 

Depending  on  the  relative  values  of  the  various  sytem  parameters,  inter¬ 
section  B  may  or  may  not  exist.  In  the  first  case  the  congestion  will  be¬ 
come  preferentially  steady  around  values  near  or  nc«  This  is  well  re- 

N 

fleeted  by  the  shape  of  the  long-run  equilibrium  distribution  [t^  (ni | N)  ^-o * 
figure  7.7.  Comparing  figure  7.7  and  7.8,  it  may  be  verified  for 
that  the  two  congestions  of  highest  probability,  ^  ~  1  and  ^  ~6,  are 
respectively  equal  to  nA  and  nc,  and  that  the  congestion  of  least  prob¬ 
ability,  viz.  nt  ~  3,  is  identical  to  n0.  Similar  extremes,  too  small  to  be 
distinguishable  on  figure  7.7,  affect  the  distributions  corresponding  to 
j  -  1  10  20.  It  may  be  conjectured  that  the  transient  time  behavior  of 
the  congestion  in  such  systems  is  increasing  oscillation  between  values  in 
the  vicinity  of  nA  and  values  in  the  vicinity  of  nc. 

If  no  B  intersection  exists,  ^(“ll^^n^O  exhibits  a  single 

maximum  equal  either  to  nA  or  nc,  depending  on  the  value  of  N.  Compare  for 
instance  figure  7.8  with  the  distributions  displayed  on  figure  7.11  for 
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J  =3  and  N=25,14  respectively, 
max 

The  behavior  of  the  congestion  as  a  function  of  N,  0  £  N  s'  “,  may  be 

summed  up  as  follows.  For  N  small,  the  congestion  at  first  remains  in 

the  vicinity  of  nA.  As  soon  as  N  exceeds  a  certain  value,  denoted 

A  J 

max 

and  defined  graphically  in  figure  7.10,  B  and  C  intersections  exist  and  the 

mean  congestion  must  reach  abruptly  a  value  between  n^  and  n^,  n^  »  n^. 

Thin  explains  the  sharp  increase  of  E  on  figure  7.9,  once  N  exceeds  a  value 

t 

which  is  equal  to  .  For  N  larger,  the  system  tends  to  behave  as  a 

max 

system  with  constant  service  rate  and  the  congestion  increases  almost 

it 

linearly  with  N  as  does  n  .  The  smaller  cr-  (J  )  is,  the  closer  to  N  is 

C  1  infix 

•n  ,  obviously.  For  N  -*  00 ,  it  is  easy  to  show  that,  for  a  given  value  of 

L* 

J  ,  E  -*  n„.  In  the  steady  state  we  may  equate  the  mean  input  and  service 
max  Ci 

rates;  the  mean  input  rate  is 


N 

X  E  TT^njN)  x  (N-nj)  =  X(N-E); 


N 

Equating  with  the  mean  service  rate  E  (nj N)i)ij  2^nl^*  we 

v1 


N  ^  2<ni) 

E  =  N  -  E  rr2(n1|N)-:j“^ -  .  (7.17) 

ni=1 


For  N  -♦  ®,  the  system  tends  to  behave  as  a  system  of  constant  service 
rate  ^  2(nc)  30  that 


tyi  2^n(p 

lim  E  =  N  -  [1  -  rr,(0|N)  ]  — — 

N-»co  i  K 


(7.18) 
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As  nc  is  solution  of  equation  (7.16),  and  ^ CO | N)  -*  0  as  K  -*  ®,  (7.18) 
becomes 

lim  E  ■  n  . 

N-*oo  c 


Nj  is  the  saturation  point  of  the  system:  the  mean  congestion  in¬ 
max 

creases  less  than  linearly  with  N,  for  N  <  Nj  ,  and  more  than  linearly 
I  max 

for  Nj  <  N  <  °>.  Nj  may  easily  be  deduced  for  given  values  of  the 
max  max 

parameters  which  define  the  hypothetical  system  by  constructing  an  equili¬ 
brium  diagram  of  the  type  defined  in  figure  7.10. 


7.13  The  System  Response  Time 

The  mean  response  time  of  the  system  is  given  by  relation  (4.70): 
w  -  [X(l  -  TT2(N|n]_1  X  E. 

It  is  the  mean  time  spent  by  a  program  in  the  system,  waiting  for  or 

being  multiprogrammed.  W  is  plotted  in  function  of  N  for  J  -3,10,20,  in 

max 

figure  7.12.  Sharp  non-linearities  may  be  observed  as  soon  as  N  exceeds 

the  saturation  point  Nj  .  For  N  larger,  the  system  tends  to  behave  as 

max 

if  the  service  rate  were  congestion-independent  so  that  the  limit  of  the 
slope  of  W  with  N  is 


lim  dW 
dN 


* 

[CPXCT,  (J 


max 


)]' 


the  smaller  a. ( J  ),  the  steeper  this  boundary  slope  is. 

x  max 

These  sharp  increases  of  the  congestion  and  of  the  response  time  when 
the  number  of  active  terminals  exceeds  the  saturation  point  are  the  conse¬ 
quences  of  the  extreme  sensitivity  of  the  processor  efficiency  (see  section 
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7.92  on  trashing)  to  fluctuations  of  the  page  demand  rate  when  the  page 

transfer  rate  is  slow  and  the  degree  of  multiprogramming  is  allowed  to 

exceed  . 

max 

The  fluctuations  of  W  in  function  of  J  for  N  fixed  are  displayed 

max 

* 

in  figure  7.13.  The  discontinuities  for  J  =7,13  correspond  to  a, (J  a  ) 

°  max  l  max 

minimae (cfr.  figure  7.6). 

By  virtue  of  the  property  of  near-decomposability  we  have  been  able  to 
specify  within  a  single  analytical  model  the  condition  of  existence  of 
thrashing  in  terms  of  parameters  defining  (i)  the  traffic  of  pages  between 
primary  and  secondary  memory  and  (ii)  the  load  on  the  system,  and  to  measure 
the  consequences  of  this  phenomenon  on  overall  system  performances  such 
as  the  congestion  and  the  response  time. 


7.14  Long-Run  and  Unconditional  Distributions 

J  J  N-J 

The  distributions  ’  W^P^i™=0  and  may 

0  1  jq  1 

be  obtained  from  the  system  congestion  {/r^CnjjN)^  _g  by  means  fo  relations 
(4.8).  More  precisely,  the  long-run  and  unconditional  probability  of  iQ 
programs  in  ready  or  running  state  is 


max 


W 


E  n2(n1|N)TT1(i0|nl) 


"l-10 


+  VlolJMx>  7  . ,  "2<nllN)- 


n.=J  +1 
1  max 


in  0,...,J  y 

0  max 


The  long-run  and  unconditional  probability  of  i|  programs  in  suspended  state  i 
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^max 

8i(iP  a  E  ,  TT2(nilN)TTi(ni-iilN) 

V1! 

VW1 


The  long-run  and  unconditional  probability  of  ij  programs  waiting  to  be 
multiprogrammed  is  simply  ( J  a-x+^  1 1 ’  ^l"®’ *  *  *  ,N"^ma  * 


Has!)  1 


□  o 

*-  7*  -E 


£  *i!  3  -I  J  i?  i 

-  ^  —  -n  ,  1  ^ 


972100 


liXd(J  ,ll) 
mpx  p 


iOOOO 

9 

« 

7 

6 

5 


<0 , 
<o  s 


,000 


set 


X 


°1  (Jmax  ) 


U"1  =400.10’6  **c 


Ht'i  ion 


972100 
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